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SULFATED CCR5 PEPTIDES FOR HIV- 1 INFECTION 
5 This application is a continuation-in-part of and claims 
the benefit of U.S. Provisional Application No. 60/267,231, 
filed February 7, 2001, U.S. Provisional Application No. 
60/205,839, filed May 19, 2000 and U.S. Provisional 
- Application No. 60/185,667, filed February 29, 2000, the 
0 contents of which are hereby incorporated by reference into 
this application. 

The invention disclosed herein was made with Government 
support under NIH Grant Nos . R01A143847 (T.D,) and 
R01DK54718 (T.P.S.) from the Department of Health and Human 
Services. Accordingly, the government has certain rights in 
this invention. 

Throughout this application, ■ various publications are 
referenced within parentheses. Disclosures of these 
publications in their entireties are hereby incorporated by 
reference into this application to more fully describe the 
state of the art to which this invention pertains. Full 
bibliographic citations for these references may be found 
immediately preceding the claims. 

Background of the Invention 

HIV-1 entry into target cells is mediated by the successive 
intex-action of the envelope glycoprotein gpl20 with CD4 and 
a co-receptor belonging to the seven trans -membrane G 
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protein-coupled chemokine x'eceptor family (Berger et al . 
Ann. Rev. Immunol. 17:657, 1999). Binding of gpl20 to CD4 
exposes or creates a co-receptor binding site on gpl2 0 
(Trkola et al . Nature 384:184, 1996, Wu et al . Nature, 
5 384:179, 1996). CCR5 and CXCR4 are the most physiologically 
relevant and widely used HIV-1 co-receptors (Zhang and 
Moore, J. Virol. 73:3443, 1999). CCR5 mediates the entry of 
R5 isolates and CXCR4 mediates the entry of X4 isolates. 
R5X4 isolates are able to exploit both co-receptors (Berger 
10 et al. Ann. Rev. Immunol. 17:657, 1999). It has been 
demonstrated that specific amino acids including acidic 
residues and tyrosines located within the CCR5 amino- 
terminal domain (Nt, amino acids 2-31) are essential for 
CCR5 -mediated fusion and entry of R5 and R5X4 HIV-1 strains 
(Dragic et al . J. Virol. 72:279, 1998; Rabut et al . J. 
Virol. 72:3464, 1998; Farzan et al . J, Virol, 72:1160, 
1998; Dorantz et ■ ai . J. Virol. 71:6305, 1997). More 
recently, Farsan et al . demonstrated that tyrosine residues 
in the CCR5 Nt are sulfated (Farzan et al . Cell 96:667, 
20 1999) 

Inhibition of cellular sulfation pathways, including 
tyrosine sulfation, by sodium chlorate decreased the 

25 binding of a gpl20/CD4 complex to CCR5^ cells (Farzan et al . 
Cell 96:667, 1999). A number of prior reports had 
implicated a role for sulfate moieties in HIV-1 entry. 
Several sulfated compounds, such as dextran sulfate, can 
inhibit HIV-1 entry by associating with CD4 or gpl2 0 

30 (Baeuerle and Huttner J. Cell Biol 105:2655, 1987; Baba et 
al . Proc. Natl. Acad. Sci . USA 85:6132, 1998). Sulfated 
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proteoglycans have been shown to bind to HIV-1 gpl20 at or 
neax" its third variable (V3) loop, which also determines 
co-receptor usage (Roderiques et al , J, Virol. 69:2233, 
1995; Hwang et al . Science 253:71, 1991). It is therefore 
5 conceivable that sulfo- tyrosines in the CCR5 Nt also 
interact with gpl20, increasing its affinity for CCRS . The 
reduction in gpl20/CD4 binding caused by the pre- treatment 
of target cells with sodium chlorate, however, cannot be 
formally attributed to a reduction in CCR5 tyrosine 
10 sulfation since chlorate can inhibit the sulfation of both 
tyrosines and proteoglycans. 

The region of the CCR5 Nt spanning amino acids 2-18 
' contains residues that are critically important for viral 

15 entr-y (Dragic et al. J. Virol. 72:279, 1998; Rabut et al . 
J. Virol. 72:3464, 1998; Farzan et al . J. Virol. 72:1160, 
1998; Doi-antz et al . J. Virol. 71:6305, 1997). We 

previously demonstrated that tyrosines at positions 3, 10 
and 14 were required for optimal co-receptor function, 

20 whereas the TyrlSPhe substituti on had little effect on 
entry (Rabut et al . J. Virol. 72:3464, 1998). Taken 
together, these findings suggested that HIV-1 entry, may be 
critically dependent upon sulfation of Tyr-3, -10 and -14, 
but not Tyr-15. We therefore explored the role of sulfo- 

25 tyrosines in positions 3, 10 and 14 by synthesizing 
peptides corresponding to amino acids 2-18 of the CCR5 Nt 
and carrying different tyrosine modifications. We first 
tested the ability of the Nt peptides to inhibit binding of 
gpl20/CD4 complexes and anti-CCR5 MAbs to CCRS"' cells. The 

30 specific association of certain peptides with gpl20/sCD4 
complexes or with anti-CCR5 MAbs was further confirmed by 



surface plasmon resonance (BIAcore) analysis. Inhibition of 
HIV-1 entry by the CCR5 Nt peptides was also tested. Our 
results suggest that amino acids 2-18 of the CCR5 Nt 
compose a gpl2 0 -binding site that determines the 
specificity of the interaction between CCR5 and gpl20s from 
R5 and R5X4 isolates. Post - translat ional sulfation of the 
tyrosine residues in the CCR5 Nt is required for gpl20 
binding and miay critically modulate the susceptibility of 
target cells to HIV-1 infection in vivo. 

CCR5 ' s normal physiologic activities involve binding and 
transducing signals mediated by CC- chemokines , including 
RANTES, MlP-la and MIP-l(i, which direct activation and 
trafficking of T cells and other inflammatory cells. As 
such, CCR5 plays an important role in mediating the 
inflammatory reaction of diseases such as rheumatoid 
arthritis and multiple sclerosis. The synovial fluid of 
rheumatoid arthritis patients is highly enriched in CCR5- 
expressing T cells (Qin et al . J Clin Invest 101:746, 
1998) , and CCR5 is the predominant CC chemokine receptor 
expressed on T cells in the rheumatoid synovium (Gomez- 
Reino et al . Ai'thritis Rheum 42:989, 1999). Similarly, 
infiltration by CCR5 - expressing cells is characteristic of 
plague lesions in patients with multiple schlerosis 
(Balashov et al , Proc Natl Acad Sci USA 96:6873, 1999). 
Such observations provide a rationale for the use of agents 
that block CCR5 for therapy of inf 1 ammiatory/autoimmune 
diseases, including but not limited to arthritis, multiple 
sclerosis, asthma, psoriasis, autoimmune diabetes, 
transplant rejection, and atherosclerosis. 
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Sujninarv of the Invention 

This invention provides a compound comprising the 
structure : 

eaYDINYYTSEPA 

wherein each T represents a threonine, each S represents a 
serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspai-tic acid, 
each I represents an isoleucine; and each N represents an 
asparagine; wherein a represents from 0 to 9 amino acids, 

0 with the proviso that if there are more than 2 amino acids, 
they are joined by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the I at position 9 and extending 
therefrom in the amino terminal direction; wherein 3 

5 represents from 0 to 13 amino acids, with the proviso that 
if there are more than 2 amino acids, they are joined by 
peptide bonds in consecutive order and have a sequence 
identical to the sequence set forth in SEQ ID NO : 1 
beginning with the P at position 19 and extending therefrom 

0 in the carboxy terminal direction; wherein 0 repx'esents an 
amino group or an acetylated amino group; wherein X 
represents a carboxyl group or an amidated carboxyl group; 
wherein all of a, Y, D, I , N, Y, Y , T, S , E and (3 are joined 
together by peptide bonds; further provided that at least 

5 two tyrosines in the compound are sulfated. 

This invention also provides a compound comprising the 
structure : 

eaYDINYYTSEpX 

0 wherein each T represents a threonine, each S represents a 
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serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoleucine; and each N represents an 
asparagine; wherein oc represents from 0 to 9 amino acids, 
with the proviso that if there are more than 2 amino acids, 
they are joined by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the I at position 9 and extending 
therefrom in the amino terminal direction; wherein (3 
represents from 0 to 333 amino acids, with the proviso that 
if there are more than 2 amino acids, they are joined by 
peptide bonds in consecutive order and have a sequence 
identical to the sequence set forth in SEQ ID NO: 1 
beginning with the P at position 19 and extending therefrom 
15 in the carboxy terminal direction; 

wherein 6 represents an amino group or an acetylated amino 
group; wherein X represents a cai-boxyl group or an amidated 
carboxyl group; wherein all of a, Y, D, I , N, Y, Y, T, S , E and 3 
are joined together by peptide bonds; further provided that 
at least two tyrosines in the compound are sulfated. 



20 



This invention provides a composition which comprises a 
carrier and an amount of one of the compounds described 
herein effective to inhibit binding of HIV-1 to a CCR5 
receptor on the surface of a CD4+ cell. 



This invention provides a method of inhibiting human 
immiunodef iciency virus infection of a CD4+ cell which also 
carries a CCR5 receptor on its surface which comprises 
contacting the CD4+ cell with an amount of one of the 
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compounds described herein effective to inhibit binding of 
human immunodeficiency virus to the CCR5 receptor so as to 
thereby inhibit human immunodeficiency virus infection of 
the CD4+ cell. 

This invention provides a method of preventing CD4-f cells 
ci a subject from becoming infected with human 
immunodeficiency virus which comprises administering to the 
subject an amount of one of the compounds described herein 
10 effective to inhibit binding of human immunodeficiency 
virus to CCR5 receptors on the surface of the CD4+ cells so 
as to thereby prevent the subject's CD4+ cells from 
becoming infected with humian immunodeficiency virus. 

15 This invention provides a method of treating a subject 
whose CD4+ cells are infected with human immunodeficiency 
virus which comprises administering to the subject an 
amount of one of the compounds described herein effective 
to inhibit binding of humian immiunodef iciency virus to CCR5 

20 receptors on the surface of the subject's CD4+ cells so as 
to thereby treat the subject. 

This invention provides a method of identifying an agent 
which inhibits binding of a CCR5 ligand to a CCR5 .receptor 
25 which comprises: 

(a) immobilizing one of the compounds described herein on 
a solid support; 

(b) contacting the imimobilized compound from step (a) with 
sufficient detectable CCR5 ligand to saturate all 

30 ■ binding sites fox' the CCR5 ligand on the immobilized 



compound under conditions permitting binding of the 
CCRS ligand to the immobilized compound so as to form 
a complex; 
(c) removing any unbound CCR5 ligand; 

id) contacting the complex from step (b) with the agent; 
and 

(e) detecting whether any CCR5 ligand is displaced from 
the complex, wherein displacement of detectable CCRS 
ligand from the complex indicates that the agent binds 
to the compound so as to thereby identify the agent as 
one which inhibits binding of the CCRS ligand to the 
CCRS receptor. 

This invention provides a method of identifying an agent 
which inhibits binding of a CCRS ligand to a CCRS receptor 
which comprises: 

(a) contacting one of the compounds described herein with 
sufficient detectable CCRS ligand to saturate all 
binding sites for the CCRS ligand on the compound 
under conditions permitting binding of the CCRS ligand 
to the compound so as to form a complex; 

(b) removing any unbound CCRS ligand; 

(c) measuring the amount of CCRS ligand which is bound to 
the compound in the complex; 

(d) contacting the complex from step (a) with the agent so 
as to displace CCRS ligand from the complex; 

(e) measuring the amount of CCRS ligand which is bound to 
the compound in the presence of the agent; and 

(f) comparing the amount of CCRS ligand bound to the 
compound in step (e) with the amount measured in step 



(c) , whex'ein a reduced amount measured in step (e) 
indicates that the agent binds "to the compound so as 
to thereby identify the agent as one which inhibits 
binding of the CCR5 ligand to the CCR5 receptor. 

This invention also provides a method of identifying an 
agent which inhibits binding of a CCR5 ligand to a CCR5 
receptor which comprises: 

(a) immobilizing one of the compounds described herein on 
a solid support; 

(b) contacting the immobilized compound from step (a) with 
the agent and sufficient detectable CCR5 ligand to 
saturate all binding sites for the CCR5 ligand on the 
compound under conditions permitting binding of the 
CCR5 ligand to the immobilized compound so as to form 
a complex; 

(c) removing any unbound CCR5 ligand; 

(d) measui'ing the amount of detectable CCR5 ligand which 
is bound to the immobilized compound in the complex; 

(e) measuring the amount of detectable CCR5 ligand which 
binds to the immobilized compound in the absence of 
the agent; 

(f) comparing the amount of CCR5 ligand which is bound to 
the immobilized compound in step (e) with the amount 
measured in step (d) , wherein a reduced amount 
mieasured in step (d) indicates that the agent binds tc 
the compound so as to thereby identify the agent as 
one which inhibits binding of the CCR5 ligand to the 
CCR5 receptor. 
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This invention also provides a method of identifying an 
agent which inhibits binding of a CCR5 ligand to a CCR5 
receptor which comprises : 

(a) contacting one of the compounds described herein with 
the agent and sufficient detectable CCR5 ligand to 
saturate all binding sites for the CCR5 ligand on the 
compound under conditions permitting binding of the 
CCR5 ligand to the compound so as to form a complex; 

(b) removing any unbound CCR5 ligand; 

(c) measuring the amount of detectable CCR5 ligand which 
is bound to the compound in the complex; 

(d) measuring the amount of detectable CCR5 ligand which 
binds to the compound in the absence of the agent; 

(e) comparing the amount of CCR5 ligand which is bound to 
t-ne compound in step (c) with the amount measured in 
step (d) , wherein a reduced amount measured in step 
(c) indicates that the agent binds to the compound so 
as to thereby identify the agent as one which inhibits 
binding of the CCR5 ligand to the CCR5 receptor. 

This invention provides a method of identifying an agent 
which inhibits binding of a CCR5 ligand to a CCR5 receptor 
which comprises : 

a; immobilizing one of the compounds described 



herein on a solid support; 



b) 



contacting the immobilized compound from step a) 
with the agent dissolved or suspended in a known 



vehicle 



and 



measuring 



the 



binding 



signal 



generated by such contact; 



C; 



contacting the immobilized compound from step a} 



with the known vehicle in the absence of the 
compound and measuring the binding signal 
genex'ated by such contact; 
d) comparing the binding signal measured in step b) 
with the binding signal measured in step c) 
wherein an increased amount measured in step b) 
indicates that the agent binds to the compound so 
as to thereby identify the agent as one which 
binds to the CCR5 receptor. 



This invention provides a method of obtaining a composition 
which comprises: 

(a) identifying a compound which inhibits binding of a 
CCR5 ligand to a CCR5 receptor according to one of the 
above methods ; and 

(b) admixing the compound so identified or a homolog or 
derivative thereof with a carrier. 



This invention provides a compound having the structure: 
A - (aYDINYYTSEpX) „ 

wherein each T represents a threonine, each S represents a 
serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each 1 represents an isoleucine; and each N represents an 
asparagine; wherein a represents from 0 to 9 amino acids, 
with the proviso that if there are more than 2 amino acids, 
they are joined together by peptide bonds in consecutive 
order and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the I at position S 
and extending therefrom in the amino terminal direction; 
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wherein (3 represents from 0 to 13 amino acids, with the 
proviso that if ther-e are more than 2 amino acids, they are 
joined togethex" by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
5 ID NO: 1 beginning with the P at position 19 and extending 
therefrom in the carboxy terminal dix'ection; whex'ein K 
represents a carboxyl group or an amidated carboxyl group; 
wherein all of a, Y, D, I , N, Y , Y, T, S , E and 3 are joined 
together by peptide bonds, further provided that at least 
10 two tyrosines in the compound are sulfated, wherein n is an 
integer from 1 to 8, A is a polymer, and the solid line 
represents up to 8 linkers which attach the structure in 
parentheses to A. 



15 This invention also provides a compound having the 
structure : 

OaYDINYYTSEp) A 

wherein each T represents a threonine, each S represents a 
serine, each E represents a glutamic acid, each Y 

20 represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoleucine; and each N x^epresents an 
asparagine; wherein a represents from 0 to 9 amino acids, 
with the px'oviso that if there ax^e mox'e than 2 amino acids, 
they ax-e joined together by peptide bonds in consecutive 

25 ox'der and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the I at position 9 
and extending therefromi in the amino terminal direction; 
wherein 3 represents from 0 to 13 amino acids, with the 
proviso that if there are miox-e than 2 amino acids, they are 

30 joined together by peptide bonds in consecutive order and 



have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the P at position 19 and extending 
therefrom in the carboxy terminal direction; wherein 6 
represents an amino group or an acetylated amino group; 
wherein all of a, D, I Y , Y, T, S , E and 3 are joined 
together by peptide bonds, further provided that at leasL 
two tyrosines in the comipound are sulfated, wherein n is an 
integer from 1 to 8, A is a polymer, and the solid line 
represents up to 8 linkers which attach the structure in 
parentheses to A. 

This invention provides a compound having the structure; 
A- (CXYDINYYTSEPX) ^. 

wherein each T represents a threonine, each S represents a 
serine, each E x^epresents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I repr-esents an isoleucine; and each N represents an 
asparagine; wherein a represents from 0 to 9 amino acids, 
with the proviso that if there are more than 2 amino acids, 
they are joined together by peptide bonds in consecutive 
order and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the I at position 9 
and extending therefrom in the amino terminal direction; 
wherein (b represents from 0 to 333 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the P at position 19 and extending 
therefrom in the carboxy terminal dii^ection; wherein A 
represents a carboxyl group or an amidated carboxyl group; 
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wherein all of a, Y , D, I , N, Y, Y, T, S , E and 3 are joined 
together by peptide bonds, 

further provided that at least two tyrosines in the 
compound are sulfated, wherein n is an integer from 1 to 8, 
^ ZN as a polymer, and the solid line represents up to 8 
linkers which attach the scructure in parentheses to A. 

This invention also provides a compound having the 
structure ; 

10 (GaYDINYYTSEp) A 

wherein each T represents a threonine, each S represents a 
serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I repx-esents an isoleucine; and each N represents an 
15 asparagine; wherein a represents from 0 to 9 amino acids, 
with the proviso that if there are more than 2 amino acids, 
they are joined together by peptide bonds in consecutive 
order and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the I at position 9 
20 and extending therefrom in the amino terminal direction; 
wherein p repx-esents from 0 to 333 amiino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical no the sequence set forth in SEQ 
25 ID NO: 1 beginning with the P at position 19 and extendinc 
therefrom in the carbcxy termtinal direction; wherein 6 
represents an amino group or an acetylated amino group; 
wherein all of a, Y, D, I , N, Y, Y , T, S , E and (3 are joined 
together by peptide bonds, further provided that at least 
30 two tyrosines in the compound are sulfated, wherein n is an 



integer from 1 co 8, A is 
represents up to 8 linkers 
pai^entheses to A. 
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a polymer, and the solid line 
which attach the structure in 
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Brlef Description of the Figures 



Fig. 1 Effect of peptides on gpl20jj..pL binding to CCR5 . 

L1.2-CCR5-* cells were incubated with the 
- biotmylated gpl2 Cjr_pl/CD4 - lcG2 complex in the 

presence of difierenr concentration of peptides 
(a) S-3/10/14, S-10/14, S-10, S-14 or (b) P- 
3/10/14, P-10/14, SR-2/12, SR-lO/14, TS-lO/14. 
The extent of complex binding in the absence of 
10 peptide was defined as 100% (m.f .i. -40+5) 

Binding in the presence of peptide is expressed 
as a percentage of control. When CCR5-negative 
cells were used, binding of the gpl2 0j^,pl/CD4 - IgG2 
complex was negligible (-10%, m.f.i. -2±1). The - 
15 values shown are from a representative 

experiment . 



Fig. 2 Binding of the gpl20/sCD4 complex to sulfated and 

phosphorylated peptides - 

20 Biotinylated peptides were immobilized on a 

sensor chip and their ability to associate with 
gpl20/sCD4 was analysed by BIAcore . RU values as 
a function of time were mieasured in the absence 
of peptide (gray dotted lines) , in the presence 

25 of phosphorylated peptide (black dotted lines) or 

in the presence of sulfated peptide (solid black 
lines) . We performed binding analyses with the 
following proteins: (a) gpl2 0^.R_FL/sCD4 , (b) gpl20j^. 
FL/ (c) sCD4, (d) DV3gpl2 Ojr.pl/ sCD4 , (e) 

30 gpl20j,Hi23/sCD4 , (f) gpl20j,Hi23/ (g) gpl20^3/sCD4 anc 
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(h) gpl20,^i. 



Fig- 3 Effect of peptides on MAb binding to CCR5 . 

L1.2-CCR5-' cells were incubated with the anti-CCR5 
5 MAbs in the presence of peptides. The extent of 

MAb binding in the absence of peptide was defined 
as 100% (m.f.i. --50-400, depending on the MAb). 
Binding in the presence of peptide is expressed 
as a percentage of contx'ol. When CCR5 -negative 
10 cells were used, binding of MAbs was negligible 

(m.f.i. ~2±1) . Each data point represents the 
mean ± s.d. of three replicates. 

Fig. 4 Binding of MAbs to sulfated and phosphorylated 

15 peptides . 

Biotinylated peptides were immobilized on a 
sensor chip and their ability to associate with 
anti-CCR5 MAbs was analyzed by BIAcore , RU values 
as a function of time were measured in the 
20 absence of peptide (gray dotted lines) , in the 

presence of phosphorylated peptide (black dotted 
lines) or in the presence of sulfated peptide 
(solid black lines) . We performed binding 
analyses with (a) PA.8 , (b) PAIO and (c) 2D7 . 

25 

Fig. 5 Effect of peptides on viral entry. 

HeLa-CD4''CCR5" cells were infected with Nlluc'env 
pseudotyped with different viral envelopes in the 
presence of peptides. Lucif erase activity 
30 (r.l.u.) was mesured 4-8 h post - infect ion . The 
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extent of entry in the absence of peptide was 
defined as 100% (r.l.u.. -25,000+ 9,000). 
Background r.l.u. values wex-e --7 + 2. Each data 
point represents the mean +. s.d. of three 
replicates. 

CCR5 Nt peptide secjuences and labels 

The primary sequence of each peptide is indicated 
in the left column and the corresponding label is 
indicated in the right column. Sulfated tyrosine 
residues are designated by black boxes and white 
boxes designate phosphorylated tyrosine residues. 

Gpl2 0/CD4 complex binding to CCR5 Nt 
sulfopep tides 

Peptide 2-18 was bound to streptavidin- coated 
biosensor chips and gpl2 Oj^.pl/sCD4 (dotted line) 
or gpl2 0jR,pi^/CD4-IgG2 (solid line) were flowed over 
the sensor chip surface. Resonance units (RU) 
were measured as a function of time using a 
Biacore X and reflect complex-peptide binding 
(a) . Sulfopeptide 2-18 (solid symbols) or 
phosphopeptide 2-18 (P) (clear symbols) were 
immobilized on streptavidin-coated ELISA plates 
and incubated with gpl2 0/CD4 - IgGs complexes. Gpl20 
proteins were derived from the R5 isolate JR-FL 
(squares) , the R5X4 isolate DH123 (circles) and 
the X4 isolate LAI (diamonds) . Complexes -peptide 
binding was detected by an HRP-conjugated goat 
ant i- human IgG antibody. O.D. at 4 50 nm was 
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measured after addition of HRP substrate and is 
expressed as a function of CD4-IgG2 concentration 
(b) . Biotinylated sulfopeptide 2-18 was 
immobilized on streptavidin-coated plates and 
5 incubated with gpl20/CD4 - IgG^ complex in the 

presence of increasing concentrations of: PA 8 
(solid squares) , TAK-779 (triangles) , Rantes 
(inverted triangles) , MIP-1 (diamonds) , MIP-1 
(circles) or SDF-1 (clear squares) . Binding of 

10 the complexes to the peptide was detected by 

incubation with HRP-conj ugated ; goat anti-human 
IgG antibody. O.D. at 450 nm was measured after 
addition of HRP substrate and percentage of 
binding was expressed as a function of inhibitor 

15 concentration. 

Fig. 8: Binding of anti-CCR5 MAbs to CCR5 Nt peptides. 

Sulf opeptides (a) or phosphopept ides (b) were 
immobilized on streptavidin-coated ELISA plates 

20 and incubated with anti-CCR5 MAbs PAS (solid 

squares) , PAID (clear circles) , PAll (solid 
circles) , PA12 (solid diamonds) or PA14 (solid 
triangles) . Binding of the MAbs to the peptides 
was detected by an HRP-conj ugated goat ant i -mouse 

25 IgG antibody. O.D. at 45 0 nm was measured after 

addition of HRP substrate and expressed as a 
function of MAb concentration. 

Fig. 9: Binding of gpl2 0aR.FL/CD4 - IgGs to different CCR5 Nt- 
30 based peptides. 
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Streptavidin plates were coated with 2-18 (black 
squares) , 10-18 (black circles) , 8-15 (black 
diamonds) , 6-16 (black stars) , 10-15 (white 
square), 10-18 (11A/18A) (black triangles). Plates 
5 were then incubated with gpl2 Ojr_pl/CD4 - IgGs 

complex. Binding of the complex to the peptide 
was detected by an HRP-conj ugat ed goat ant i -human 
IgG antibody. O.D. at 450 nm was measured after 
addition of HRP substrate and expressed as a 
10 function of CD4-IgG2 concentration (nM) . 

Fig. 10: Inhibition of gpl2 0/CD4 - IgGj complex binding to 
sulf o-peptides by anti-gpl20 MAbs 

Biotinylated sulfopeptide 2-18 was bound to 
15 streptavidin-coated biosensox' chips and solutions 

of either gpl2 0jR.px,/CD4-lgG2 complex (black bars) 
or gpl2 0D„i23/CD4-IgG2 complex (white bars) were 
flowed over the surface of the chip in the 
presence of different anti-gpl20 MAbs. The names 
-0 of the MAbs and the location of their epitopes 

are indicated along the x-axis. Resonance units 
(RU) were measured as a function of time- using a 
Biacore X and reflect complex-peptide binding in 
the presence of the MAbs. Gpl2 0/CD4 -IgGs binding 
-5 was calculated using the formula: (RU in the 

presence of MAbs) / (RU in the absence of MAbs) 
xlOO%. The values shown are from a sample 
experiment . 



30 



Fig. 11: Binding of gpl20 mutants to sulf o-peptide and 



10 



15 



20 



25 

Fig. 12: 
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wild type CCR5 . 

Sulf o-pept ide 2-18 was immobilized on 

streptavidin-coated plates and incubated with a 
mixture of gpl20-containing supernatants and CD4 - 
IgGj. Pept ide-complex binding was detected by an 
HRP- conjugated goat ant i -human IgG antibody. O.D. 
at 450 nm was measured after addition of HRP 
substrate and normalized for binding of the gpl2 0 
mutants to CD4-IgG2. The doted line represents the 
normalized value for the binding of the wild type 
gpl2 0 to the peptide. The mutated amino acids and 
their locations in gpl20 are indicated along the 
X-axis (a) . L12-CCR5"' cells were incubated with a 
mixture of gpl2 0- containing supernatants and CD4 - 
IgGs- Binding of the complex was detected by FACS 
analysis after addition of streptavidin- PE . 
Percentage of gpl2 0/CD4 - IgGs binding to CCR5 was 
normalized for gpl2 0 binding to CD4-IgG2. The 
doted line represents the normalized value for 
the binding of wild-type gpl20 to the L12-CCR5-*- 
cells. The mutated amino acids and their 
locations in gpl20 are indicated along the x-axis 
(b) . 



Amino acid sequences of CCR5 Nt-based peptides. 

The peptides are named according to the positions 
of their first and last residues in the full- 
length sequence of CCR5 . They contain either 
sulf otyrosines (black boxes) or phosphotyrosines 
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(white boxes) n positions 10 and 14. Residues 
Asp-11 and Glu-18 in peptide 10-18 (11A/18A) are 
substituted for alanines. All peptides carry .a 
carboxy terminal GAG spacer . followed by a 
5 biotinylated lysine. 

Fig. 13: Amino acid conservation among R5 isolates. 

Envelope sequences from 25 R5 strains described 
in the HIV Database and retrieved from the 

10 National Center for Biotechnology Information 

GenBank were aligned and percentage of 
conservation for the indicated residues was 
calculated and combined with results from Hung et 
al , , 1999 (REF) . Alanine mutants showing more 

15 than 50 % decrease in sulfopeptide 2-18 binding 

compared to the wild type are highlighted in 
gray. 
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Detailed Description of the Invention 

The plasmids CD4-IgG2-HC-pRcCMV and CD4 -kLC-pRcCMV were 
deposited pursuant to, and in satisfaction of, the 
requirements of the Budapest Treaty on the International 
5 Recognition of the Deposit of Microorganisms (the "Budapest 
Treaty") for the Purposes of Patent Procedure with the 
American Type Culture Collection (ATCC) , 10801 University 
Boulevard, Manassas, Virginia 20110-2209 under ATCC 
Accession Nos . 75193 and 75194, respectively. 

10 

The plasmids designated PPI4 - tPA-gpl20jRipL and PPI4-tPA- 
gpl20LAi were deposited pu^rsuant to, and in satisfaction of, 
the requirements of the Budapest Treaty on the 
International Recognition of the Deposit -of Microorganisms 

15 for the Purposes of Patent Procedure with the American Type 
Culture Collection (ATCC) , 10801 University Boulevard, 
Manassas, Virginia 20110-2209 under ATCC Accession Nos. 
75431 and 75432, respectively. These plasmids were 

deposited with ATCC on March 12, 1993. These eukaryotic 

20 shuttle vectors contain the cytomegalovirus major 
immediate-early (CMV MIE) promoter/enhancer linked to the 
full-length HIV-1 envelope gene whose signal sequence was 
replaced with that derived from tissue plasminogen 
activator. In the vector, a stop codon has been placed at 

25 the gpl2 0 C- terminus to prevent translation of gp41 
sequences, which are present in the vector. The vector 
also contains an ampiciilin resistance gene, an SV4 0 oi'igin 
of replication and a DKFR gene whose transcription is 
driven by the (S-globin promoter. 



The monoclonal antibodies PA8 , PAIO, PAll, PA12, and PA14 
were deposited pursuant to and in satisfaction of, the 
requirements of the Budapest Treaty on the International 
Recognition of the Deposit of Microorganisms for the 
Purposes of Patent Procedure with the American Type Culture 
Collection (ATCC) , 10801 University Boulevard, Manassas, 
Virginia 20110-2209 on December 2, 1998 under the following 
Accession Nos . : ATCC Accession No. HB-12605 (PA8), ATCC 
Accession No.HB-12607 (PAIO) , ATCC Accession No. HB-12608 
(PAID, ATCC Accession No. HB-12609 (PA12); and ATCC 
Accession No. HB-12610 (PA14) . 

As used herein, the following standard abbreviations are 
used throughout the specification to indicate specific 
amino acids : 



A=ala=alanine 
N=asn^asparagine 
C=cys=cysteine 
E=glu=glutamic acid 
H=his=histidine 
L=leu= leucine 
M=met=methionine 
P=pro=proline 
T=thr= threonine 
y=tyr= tyrosine 
B=asx=asparagine or as 
Z=glx=glutamine or glu 



R=arg=arginine 

D=asp=aspart ic acid 

Q=gln=glut amine 

G=gly=glycine 

I=ile=isoleucine 

K=lys= lysine 

F =phe =pheny 1 a 1 anine 

S=ser=serine 

W=trp= tryptophan 

V=val =val ine 

partic acid 

tamic acid 



As used herein, the following standard abbreviations are 



I 
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used throughout the specification to indicate specific 
nucleotides: C=cytosine; A=adenosine; T=thymidine; 
G= guanosine; and U=uracil. 

5 This invention provides a compound comprising the 
structure : 

eaYDINYYTSE^A 

wherein each T represents a threonine, each S represents a 
serine, each E represents a glutamic acid, each Y 

10 represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoleucine; and each N represents an 
asparagine; wherein a represents from 0 ,to 9 amino acids, 
with the proviso that if there are more than 2 amino acids, 
they are joined ^by peptide bonds in consecutive order and 

15 have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the I at position 9 and extending 
therefrom in the amino terminal direction; wherein p 
represents from 0 to 13 amino acids, with the proviso that 
if thex'e are more than 2 amino acids, they are joined by 

20 peptide bonds in consecutive order and have a sequence 
identical to the sequence set forth in SEQ ID NO: 1 
beginning with the P at position 19 and extending therefrom 
in the carboxy terminal direction; 

wherein G repr'esents an amino group or an acetylated amino 
25 group; wherein X represents a carboxyl group or an amidated 
cai'boxyl group; wherein all of a , Y , D, I , N, Y , Y , T, S , E and (i 
are joined together by peptide bonds; further provided that 
at least two tyrosines in the compound are sulfated. 

30 In one embodiment of the above compound, the compound is - 



peptide which comprises consecutive amino acids having the 
sequence YDINYYTSE. 

In one embodiment of the above compound, the tyrosines at 
positions 1 and 5 of the sequence YDINYYTSE are sulfated. 

In one embodiment of the above compound, a represents less 
than 9 amino acids . In another embodiment of the above 
compound, a represents less than 8 amino acids. In another 
embodiment of the above compound, a represents less than 7 
amino acids. In another embodiment of the above compound, oc 
represents less than 6 amino acids. In another embodiment 
of the above compound, a represents less than 5 amino 
acids. In another embodiment of the above compound, a 
represents less than 4 amino acids. In another embodiment 
of the above compound, a represents less than 3 amino, 
acids. In another embodiment of the above compound, a 
represents less than 2 amino acids. In another embodiment 
of the above compound, Of represents less than 1 amino acid. 

In one embodiment of the above compound, p represents less 
than 17 amino acids. In one embodiment of the above 
compound, (3 represents less than 16 amino acids. In one 
embodiment of the above compound, 3 represents less than 15 
amino acids. In one embodiment of the above compound, P 
represents less than 14 amino acids. In one embodiment of 
the above compound, 3 represents less than 13 amino acids. 
In one embodiment of the above compound, 3 represents less 
than 12 amino acids. In one embodiment of the above 
compound, 3 represents less than 11 amino acids. In one 
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embodiment of the above compound, p represents less than 10 
amino acids. In one embodiment of the above compound, P 
represents less than 9 amino acids. In one embodiment of 
the above compound, 3 repi'esents less than 8 amino acids. 
5 In one embodiment of the above compound, P represents" less 
than 7 amino acids . In one embodiment of the above 
compound, |3 represents less than. 6 amino acids. In one 
embodiment of the above compound, (3 represents less than 5 
amino acids. In one embodiment of the above compound, (3 
10 represents less than 4 amino acids. In one embodiment of 
the above compound, P represents less than 3 amino acids. 
In one embodiment of the above compound, P represents less 
than 2 amino acids. In one embodiment of the above 
compound, P represents less than 1 amino acid. 

15 

This invention also provides a compound comprising the 
structure: 

eaYDINYYTSEPA 

whei'ein each T represents a threonine, each S represents a 
20 serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoleucine; and each N represents an 
asparagine; wherein a represents from 0 to 9 amino acids, 
with the proviso that if there are more than 2 amino acids, 
25 they are joined by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the I at position 9 and extending 
therefrom in the amino tei'minal direction; wherein P 
represents from 0 to 333 amino acids, with the proviso that 
30 if ther-e are more than 2 amino acids, they are joined by 
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peptide bonds in consecutive order and have a sequence 
identical to the sequence set forth in SEQ ID NO: 1 
beginning with the P at position 19 and extending therefrom 
in the carboxy terminal direction; 
5 wherein 6 repx-esents an amino group or an acetylated amino 
group; wherein A represents a carboxyl group or an amidated 
carboxyl group; wherein all of a,Y,D,I,N,Y,Y,T,S,E and p 
are joined together by peptide bonds; further provided that 
at least two tyrosines in the compound are sulfated. 

10 

In the compounds described hei'ein and as exemplified above, 
the 3 in each compound may alternatively represent from 0 
to 333 amino acids . 

15 In one embodiment of the compounds described herein, (5 
represents less than 300 amino acids. In another embodiment 
of the above compound, |3 represents less than 250 amino 
acids. In another embodiment of the above compound, 3 
repi'esents less than 200 amino acids. In another embodiment 

20 of the above compound, (3 represents less than 150 amino 
acids. In another embodiment of the above compound, 3 
represents less than 100 amino acids. In another embodiment 
of the above compound, 3 represents less than 75 amino 
acids. In another embodiment of the above compound,' 3 

25 represents less than 50 amino acids. In another embodiment 
of the above compound, 3 represents less than 40 amino 
acids. In another embodiment of the above compound, 3 
represents less than 35 amino acids. In another embodiment 
of the above' compound, 3 represents less than 30- amino 

30 acids. In another embodiment of the above compound, 3 



represents less than 25 amino acids. In another embodiment 
of the above compound, (3 represents less than 20 amino 
acids. In another embodiment of the above compound, 3 
represents less than 19 amino acids. In another embodiment 
of the above compound, 3 represents less than 18 amino 
acids. In another embodiment of the above compound, 3 
represents less than 17 amino acids. In another embodiment 
of the above compound, 3 represents less than 16 amino 
acids. In another embodiment of the above compound, 3 
represents less than 15 amino acids. In another embodiment 
of the above compound, 3 represents less than 14 amino 
acids. In another embodiment of the above compound, 3 
represents less than 13 amino acids. In another embodiment 
of the above compound, 3 represents less than 12 amino 
acids. In another embodiment of the above compound, 3 
represents less than 11 amino acids. 

In one embodiment of the above compound, a represents less 
than 9 amino acids. In another embodiment of the above 
compound, a represents less than 8 amino acids. In another 
embodiment of the above compound, oc represents less than 7 
amino acids. In another embodiment of the .above compound, a 
represents less than 6 amino acids. In another embodiment 
of the above compound, a represents less than 5 amino 
acids. In another embodiment of the above compound, a 
represents less than 4 amino acids. In another embodiment 
of the above compound, a represents' less than 3 amino 
acids. In another embodiment of the above compound, a 
represents less than 2 amino acids. In another embodiment 
of the above compound, a represents less than 1 amino acid. 
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The CCR5 amino acid sequence is the following and is set 
forth in SEQ ID NO : 1 : 

1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLV 
41 FIFGFVGNMLVILILINCKRLKSMTDIYLIjNLiAISDLFFL 

81 ltvffwahyaaaqwdfgntmcolltglyfigffsgiffii 
121 lltidryliawhavfalkartvtfgwtsvitwwavfas 
161 lpgi i ftrsqkeglhytcsshfpysqyqfwknfqtlki vi 

2 01 IjGLVLiPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIFTI 
2 41 MIVYFLFWAPYNIVLLLNTFQEFFGIiNNCSSSNRLDQAMQ 

2 81 VTETLGMTHCCINPIIYAFVGEKFRNYLLVFFQKHIAKRF 

3 21 CKCCSI FQQEAPERASSVYTRSTGEQEI SVGL 3 52 



The CCR5 nucleotide sequence is the following and is set 
forth in SEQ ID NO : 2 : 



1 


GAATTCCCCC AACAGAGCCA AGCTCTCCAT C 


rAGTGGACA GGGAAGCTAG CAGCAAACCT 


61 


TCCCTTCACT 


ACAAAACTTC 


ATTGCTTGGC 


CAAAAAGAGA 


GTTAATTCAA 


TGTAGACATC 


121 


TATGTAGGCA 


ATTAAAAACC 


TATTGATGTA 


TTUW^CAGTT 


TGCATTCATG 


GAGGGCAACT 


181 


AAATACATTC 


TAGGACTTTA 


TAAAZIGATCA 


CTTTTTATTT 


ATGCACAGGG 


TGGAACAAGA 


241 


TGGATTATCA 


AGTGTCAAGT 


CCAATCTATG 


ACATCAATTA 


TTATACATCG 


GAGCCCTGCC 


301 


AAAAAATCAA 


TGTGAAGCAA 


ATCGCAGCCC 


GCCTCCTGCC 


TCCGCTCTAC 


TCACTGGTGT 


361 


TCATCTTTGG TTTTGTGGGC AACATGCTGG TCATCCTCAT CCTGATAAAC TGCAAAAGGC 


421 


TGAAGAGCAT 


GACTGACATC 


TACCTGCTCA 


ACCTGGCCAT 


CTCTGACCTG 


TTTTTCCTTC 


481 


TTACTGTCCC 


CTTCTGGGCT 


CACTATGCTG 


CCGCCCAGTG 


GGACTTTGGA . 


AATACAATGT 


541 


GTCAACTCTT 


GACAGGGCTC 


TATTTTATAG 


GCTTCTTCTC 


TGGAATCTTC 


TTCATCATCC 


601 


TCCTGACAAT 


CGATAGGTAC 


CTGGCTGTCG 


TCCATGCTGT 


GTTTGCTTTA 


AAAGCCAGGA 


661 


CGGTCACCTT 


TGGGGTGGTG 


ACAAGTGTGA 


TCACTTGGGT 


GGTGGCTGTG 


TTTGCGTCTC 


721 


TCCCAGGAAT 


CATCTTTACC 


AGATCTCAAA 


AAGAAGGTCT 


TCATTACACC 


TGCAGCTCTC 


781 


ATTTTCCATA 


CAGTCAGTAT 


CAATTCTGGA 


AGAATTTCCA 


GACATTAAAG 


ATAGTCATCT 


841 


TGGGGCTGGT 


CCTGCCGCTG 


CTTGTCATGG 


TCATCTGCTA 


CTCGGGAATC 


CTAAAAACTC 


901 


TGCTTCGGTG 


TCGAAATGAG 


AAGAAGAGGC 


ACAGGGCTGT 


GAGGCTTATC 


TTCACCATCA 


961 


TGATTGTTTA 


TTTTCTCTTC 


TGGGCTCCCT 


ACAACATTGT 


CCTTCTCCTG 


AACACCTTCC 


1021 


AGGAATTCTT 


TGGCCTGAAT 


AATTGCAGTA 


GCTCTAACAG 


GTTGGACCAA 


GCTATGCAGG 


1081 


TGACAGAGAC 


TCTTGGGATG 


ACGCACTGCT 


GCATCAACCC 


CATCATCTAT 


GCCTTTGTCG 


1141 


GGGAGAAGTT 


CAGAAACTAC 


CTCTTAGTCT 


TCTTCCAAAA 


GCACATTGCC 


AAACGCTTCT 



12 01 GCAAATGCTG TTCTATTTTC CAGCAAGAGG CTCCCGAGCG AGCAAGCTCA GTTTACACCC 
12 61 GATCCACTGG GGAGCAGGT^ . ATATCTGTGG GCTTGTGACA CGGACTCAAG TGGGCTGGTG 
1321 ACCCAGTCAG AGTTGTGCAC ATGGCTTAGT TTTCATACAC AGCCTGGGCT GGGGGT 

The YDINYYTSE sequence corresponds to amino acid residues 
10-18 of the CCR5 sequence set forth above. 

As used herein, "CCR5" is a chemokine receptor which binds 
members of the CC group of chemokines and whose amino acid 
sequence comprises that provided in Genbank Accession 
Number 1705896 and . related polymorphic variants. The 
nucleotide sequence comprises that provided in Genbank 
Accession Number X91492. In one embodiment, the above 
compound may correspond to the . extracellular portion of 
CCR5. The first 31 amino acids of CCR5 correspond to the 
extracellular portion of CCR5 . Accordingly,. the 
extracellular portion extends from the methionine at 
position number 1 to the arginine at position number 31 of 
SEQ ID NO:l. In another embodiment, the above compound may 
correspond to the amino- terminal portion of, CCR5 . As used 
herein, "N- terminus" or amino- terminus means the sequence of 
amino acids spanning the initiating methionine and the 
first transmembrane region. 

As used herein, "H2N" refers to the N- terminus or amino- 
terminus. As used herein, "COOH" refers to the C-terminus 
or carboxy- terminus . 

Various tyrosines of the compounds described herein may be 
sulfated. These include but are not limited to the 
tyrosines at positions 3, 10 and 14 of amino acid sequence 
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set forth in SEQ ID NO : 1 . Accordingly, in one embodiment, 
the tyrosines at positions 10 and .14 are sulfated. In 
another embodiment, the tyrosines at positions 3 and 14 are 
sulfated. In another embodiment, the tyrosines at positions 
5 3 and 10 are sulfated. In another embodiment, the tyrosines 
at positions 3, 10 and 14 are sulfated. Other tyrosines in 
the sequence set forth in SEQ ID NO : 1 may also be sulfated. 

This invention provides a composition comprising one of the 
10 compounds described, herein and a detectable marker attached 
thereto. In one embodiment of the ; composition, the 
detectable marker is biotin. In one embodiment of the 
composition, the detectable marker is attached at the C- 
terminus of the compound. 

15 

The compounds of the subject invention may also be isolated 
or purified. In one embodiment the compound is labeled with 
a detectable marker. As used herein, chemical "labels" 
include radioactive isotopes, fluorescent groups and 
20 affinity moieties such as biotin that facilitate detection 
of the labeled peptide. Other chemical labels are well- 
known to those skilled in the art . Methods for attaching 
chemical labels to peptides are well-known to the skilled 
artisan. 

25 

As used herein, "peptide" and "polypeptide" are used to 
denote two or more amino acids linked by a peptidic bond 
between the a-carboxyl group of one amino acid and the a- 
amino group of the next amino acid. Peptides may be 
30 produced by solid-phase synthetic methods that are well- 



known to those skilled in the art. In addition to the above 
set of twenty amino acids that are used for protein 
synthesis in vivo, peptides may contain additional amino 
acids, including but not limited to hydroxyproline , 
sarcosine, and y- carboxyglutamate . The peptides may contain 
modifying groups including but not limited to sulfate and 
phosphate moieties. Peptides can be comprised of L- or D- 
amino acids, which are mirror-image forms with differing 
optical properties. Peptides containing D-amino acids have 
the advantage of being less susceptible to proteolysis in 
vivo . 

Peptides may by synthesized in monomeric linear form, 
cyclized form or as oligomers such as branched multiple, 
antigen peptide (MAP) dendrimers (Tarn et ai . Biopolymers 
51:311, 1999). Nonlinear peptides may have increased, 
binding affinity by virtue of their restricted 
conformations and/or oligomeric nature. Peptides may also 
be produced using recombinant methods as either isolated 
peptides or as a portion of a larger fusion protein that 
contains additional amino acid sequences. 

Peptides may be chemically conjugated to proteins by a 
variety of . well-known methods. Such peptide-protein 
conjugates can be formulated with a suitable adjuvant and 
administered parenterally for the purposes of generating 
polyclonal and monoclonal antibodies to the peptides of 
interest. Alternatively, unconjugated peptides can be 
foi-mulated. with adjuvant and administered to laboratory 
animals for the purposes of generating antibodies. Methods 



for generating and isolating such antibodies are well-known 
to those skilled in the art. 

This invention provides derivatives of the above compound . 
As used herein, a "derivative" peptide is one whose amino 
acid sequence is nonidentical to the reference peptide but 
which possesses functionally similar binding proper-ties. 
Derivative peptides ma'y also contain N-terminal, C-terminal 
and/or internal insertions, deletions, or substitutions of 
amino acids, with the proviso that such insertions, 
deletions and substitutions do not abrogate the binding 
properties of the peptide. Derivative peptides include 
peptides modified with chemical labels to facilitate 
detection. Derivative peptides include branched and 
cyclized peptides. 

As used herein, "sulf opeptides" are peptides that contain 
sulfate moieties attached to one. or more amino acids, such 
as tyrosine. In "sulf o-tyrosines", a sulfate group replaces 
the para -hydroxyl group located on tyrosine side-chain. 

As used herein, "phosphopeptides" are peptides that contain 
phosphate moieties attached to one or more amino acids, 
such a tyrosine. In "phospho- tyrosines" , a phosphate group 
replaces the para-hydroxyl gi-oup located on tyrosine side- 
chain. - 

The peptides of the subject invention may be sulfated when 
synthesized or they may be subsequently sulfated. For 
example, means of sulfating the peptides include chemical 



sulfation or enzymatic sulfation. One skilled in the art 
would know how to employ these and other techniques to 
sulfate the compound. ' 



This invention provides a composition which comprises a 
carrier and an amount of one of the compounds ' described 
herein effective to inhibit binding of HIV-1 to a CCR5 
receptor on the surface of a CD4-f cell. 



The carriers include but are not limited to an aerosol, 
intravenous, oral or topical carrier. Accordingly, The 
invention provides the above composition adapted for 
aerosol, intravenous, oral or topical application. 

This invention provides the above compositions and a 
phar-maceutically acceptable carrier. Pharmaceut ically 

acceptable carriers are well known to those skilled in the 
art. Such pharmaceutically acceptable carriers may include 
but are not limited to aqueous or non- aqueous solutions, 
suspensions, and emulsions. Examples of non-aqueous 

solvents are propylene glycol, polyethylene glycol, 
vegetable oils such as olive oil, and injectable organic 
esters such as ethyl oleate. Aqueous carriers include 
water, alcoholic/aqueous solutions, emulsions or 

suspensions, saline and buffered media. Parenteral 
vehicles include sodium chloride solution. Ringer »s 
dextrose, dextrose and sodium chloride, lactated Ringer's 
or fixed oils. Intravenous vehicles include fluid and 
nutrient replenishers , electrolyte replenishers such as 
those based on Ringer's dextrose, and the like. 
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Preservatives and other additives may also be present, such 
as, for example, antimicrobials, antioxidants, chelating 
agents, inert gases and the like. 

5 As used herein, "composition" means a mixture. The 
compositions include but are not limited to those suitable 
for oral, rectal, intravaginal , topical, nasal, opthalmic, 
or parenteral administration to a subject. As used herein, 
"parenteral" includes but is not limited to subcutaneous, 
10 intravenous, intx-amuscular , or intrasternal injections or 

infusion techniques. i 

1 

I 

As used herein, "administering" may be effected or performed 
using any of the methods known to one skilled in the art . 

15 The methods may comprise intravenous, intramuscular or 
subcutaneous means. As used herein, "effective dose" means 
an amount in sufficient quantities to either treat the 
subject or prevent the subject from becoming infected with 
HIV-1. A person of ordinary skill in the art can perform 

20 simple titration experiments to determine what amount is 
required to treat the subject. 

This invention provides a method of inhibiting human 
immunodeficiency virus infection of a CD4+ cell which also 

25 carries a CCR5 receptor on its surface which comprises 
contacting the CD4+ cell with an amount of one of the 
compounds described herein effective to inhibit binding of 
human immunodeficiency virus to the CCR5 receptor so as to 
thereby inhibit human immunodeficiency virus infection of 

30 the CD4+ cell. As used herein, "inhibits" means that the 



amount is reduced. In a .preferred embodiment, inhibits 
means that the amount is reduced 100%. 

In one embodiment of this method, the CD4+ cell is present 
in a subject and the contacting is effected by- 
administering the compound to the subject. 

This invention provides a method of preventing CD4+ cells 
of a subject from becoming infected with human 
immunodeficiency virus which comprises administering to the 
subject an amount of one of the compounds described herein 
effective to inhibit binding of human immunodeficiency 
virus to CCR5 receptors on the surface of the CD4+ cells so 
as to thereby prevent the subject's CD4+ cells from 
becoming infected with human immunodeficiency virus . 

This invention provides a method of treating a subject 
whose CD4+ cells are infected with human immunodeficiency 
virus which comprises administering to the subject an 
amount of one of the compounds described herein effective, 
to inhibit binding of human immunodeficiency virus to CCR5 
receptors on the surface of the subject's CD4+ cells so as 
to thereby treat the subject. 

As used herein, human immunodeficiency virus includes but 
is not limited to HIV-1, which is the human 
immunodeficiency virus type-1. HIV-1 includes but is not 
limited to extracellular virus particles and the forms of 
HIV-1 found in HIV-1 infected cells. 
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As used herein, "HIV-1 infection" means the introduction of 
HIV-1 genetic information into a target cell, such as by 
fusion of the target cell membrane with HIV-1 or an HIV-1 
envelope glycoprotein-^ cell. The target cell may be a bodily 
cell of a subject. In the preferred embodiment, the target 
cell is a bodily cell from a human subject. 

As used herein, "inhibiting HIV-1 infection" means the 
reduction of the amount of HIV-1 genetic information 
introduced into a target cell population as compared to the 
amount that would be introduced without the composition. 

In the above methods, the compound may be administered by 
various routes including but not limited to aerosol, 
intravenous, oral or topical route. The administration may 
comprise intralesional , intraperitoneal, intramuscular or 
intravenous inj ection; infusion; liposome-mediated 

delivery; topical, intrathecal, gingival pocket, per 
rectum, intrabronchial , nasal, oral, ocular or otic 
delivery. In a further embodiment, the administration 
includes intrabronchial administration, anal, intrathecal 
administration or transdermal delivery. In another 
embodiment, the compound is administered hourly, daily, 
weekly, monthly or annually. In another embodiment, the 
effective amount of the compound comprises from about 
0.000001 mg/kg body weight to about 100 mg/kg body weight. 

The administration may be constant for a certain period of 
time or periodic and at specific intervals. The compound 
may be delivered hourly, daily, weekly, monthly, yearly 
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(e.g. in a time release form) or as a one time delivery. 
The delivery may be continuous delivery for a period of 
time, e.g. intravenous delivery. 

5 -The carrier may be a diluent, an aerosol, a topical 
carrier, an ageuous solution, a nonaqueous solution or a 
solid carrier. 

The effective amount of the compound may comprise from 

10 about 0.000001 mg/kg body weight to about 100 mg/kg body 
weight. In one embodiment, the effective amount may 

comprise from about 0.001 mg/kg body weight to about »50 
i^g/kg body weight. In another embodiment, the effective 
amount may range from about 0.01 mg/kg body weight to about 

15 10 mg/kg body weight. The actual effective amount will be 
based upon the size of the compound, the biodegradability 
of the compound, the bioactivity of the compound and the 
bioavailability of the compound. If the compound does not 
degrade quickly, is bioavailable and highly active, a 

20 smaller amount will be required to be effective. The 
effective amount will be known to one of skill in the art ; 
it will also be dependent upon the form of the' compound, 
the size of the compound and the bioactivity of the 
compound. One of skill in the art could routinely perform 

25 empirical activity tests for a compound to determine the 
bioactivity in bioassays and thus determine the effective 
amount . 

The compound of the present invention may be delivered 
30 locally via a capsule which allows sustained release of the 
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agent or the peptide over a period of time. Controlled or 
sustained release compositions include formulation in 
lipophilic depots (e.g., fatty acids, waxes, oils). Also 
comprehended by the invention are particulate compositions 
coated with polymers (e.g., poloxamers or poloxamines) and 
the agent coupled to antibodies directed against tissue- 
specific receptors, ligands or antigens or coupled to 
ligands of tissue-specific receptors. Other embodiments of 
the compositions of the invention incor-porate particulate 
forms protective coatings, protease inhibitors or 
permeation enhancers for various routes of administration, 
including parenteral, pulmonary, nasal and oral. 

In one embodiment of the above methods, the subject is 
infected with HIV-1 prior to administering the compound to 
the subject. In one embodiment of the above methods, the 
subject is not infected with HIV-1 prior to administering 
the compound to the subject. In one embodiment of the above 
methods, the subject is not infected with, but has been 
exposed to, human immunodeficiency virus. 

In one embodiment of the above methods, the effective 
amount of the compound comprises from about 1.0 ng/kg to 
about 100 mg/kg body weight of the subject. In another 
embodiment of the above methods, the effective amount of 
the compound comprises from about 100 ng/kg to about 5 0 
i^g/kg body weight of the subject. In another embodiment of 
the above methods, the effective amount of the compound 
comprises from about 1 /ig/kg to about 10 mg/kg body weight 
of the subject. In another embodiment of the above methods. 



the effective amount of the compound comprises from about 
100 jLtg/kg to about 1 mg/kg body weight of the subject. 

The dose of the composition of the invention will vary 
depending on the subject and upon the particular route of 
administration used. Dosages can range from 0.1 to 100,000 
Mg/kg. Based upon the composition, the dose can be 
delivered continuously, such as by continuous pump, or at 
periodic intervals. For example, on one or more separate 
occasions. Desired time intervals of multiple doses of .a 
particular composition can be determined without undue 
experimentation by one skilled in the art. 

As used herein, "effective dose" means an amount in 
sufficient quantities to either treat the subject or 
prevent the subject from becoming infected with HIV-1. A 
pei-son of ordinary skill in the art can perform simple 
titration experiments to determine what amount is required 
to treat the subject. 

In one embodiment of the above method, the subject is a 
human being. As used herein, "subject" means any animal or 
artificially modified animal capable of becoming HIV- 
infected. Artificially modified animals include, but are 
not limited to, SCID mice with human immune systems. The 
subjects include but are not limited to mice, rats, dogs, 
guinea pigs, ferrets, rabbits, and primates. In the 
preferred embodiment, the subject is a human being. 



This invention provides a vaccine which comprises the 
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compound described herein. Vaccines comprising the 
sulf opeptides and a suitable adjuvant could be administered 
to a subject for the purposes of generating antibodies or 
other immune responses that are of therapeutic or 
5 prophylactic value. For example, the vaccines could be 
administered for the purpose of generating in the subject 
antibodies that bind CCR5 and inhibit its ability to 
mediate HIV entry and infection, thereby protecting the 
subject from HIV infection or disease progression. The 
10 vaccines may also comprise a suitable adjuvant. The vaccine 
may also comprises a suitable carrier. { 

The subject invention has various applications which 
includes HIV treatment such as treating a subject who has 

15 become afflicted with HIV. As used herein, "afflicted with 
HIV-1" means that the subject has at least one cell which 
has been infected by HIV-1. As used herein, "treating" 
means either slowing, stopping or reversing the progression 
of an HIV-1 disorder. In the preferred embodiment, 

20 "treating" means reversing the progression to the point of 
eliminating the disorder. As used herein, "treating" also 
means the reduction of the number of viral infections, 
reduction of the number of infectious viral particles, 
reduction of the number of virally infected cells, or the 

25 amelioration of symptoms associated with HIV-1. Another 
application of the subject invention is to prevent a 
subject from contracting HIV, As used herein, "contracting 
HIV-1" means becoming infected with HIV-1, whose genetic 
information replicates in and/or incorporates into the host 

30 cells. Another application of the subject invention is to 



treat a subject who has become infected with HIV-1. As used 
herein, "HIV-1 infection" means the introduction of HIV-1 
genetic information into a target cell, such as by fusion 
of the target cell membrane with HIV-1 or . an HIV-1 envelope 
glycoprotein"" cell. The target cell may be a bodily cell of 
a subject. In the preferred embodiment, the target cell is 
a bodily cell from a human subject. Another application of 
the subject invention is to inhibit HIV-1 infection. As 
used herein, "inhibiting HIV-1 infection" means reducing 
the amount of HIV-1 genetic information introduced into a 
target cell population as compared to the amount that would 
be introduced without said composition. 

This invention provides a method of identifying an agent 
which inhibits binding of a CCR5 ligand to a CCR5 receptor 
which comprises: 

(a) immobilizing one of the compounds described herein on 
a solid support; 

(b) contacting the immobilized compound from step (a) with 
sufficient detectable CCR5 ligand to saturate all 
binding sites for the CCR5 ligand on the immobilized 
compound under conditions permitting binding of the 
CCR5 ligand to the immobilized compound so as to form 
a complex; 

(c) removing any unbound CCR5 ligand; 

(d) contacting the complex from step (b) with the agent; 
and 

(e) detecting whether any CCR5 ligand is displaced from 
the complex, wherein displacement of detectable CCR5 
ligand from the complex indicates that the agent binds 



to the compound so as to thereby identify the agent as 
one which inhibits binding of the CCR5 ligand to the 
CCR5 receptor. 



This invention provides a method of identifying an agent 
which inhibits binding of a CCR5 ligand to a CCR5 receptor 
which comprises: 

(a) contacting one of the compounds described herein with 
sufficient detectable CCR5 ligand to saturate all 
binding sites for the CCR5 ligand on the compound 
under conditions permitting binding of the CCR5 ligand 
to the compound so as to form a complex; - 

(b) removing any unbound CCR5 ligand; 

(c) measuring the amount of CCR5 ligand. which is bound to 
the compound. in the complex; 

(d) contacting the complex, from step (a) with the agent so 
as to displace CCR5 ligand from the complex; 

(e) measuring the amount of CCR5 ligand which is bound to 
the compound in the presence of the agent; and 

(f) comparing the amount of CCR5 ligand bound to the 
compound in step (e) with the amount measured in step 
(c) , wherein a reduced amount measured in step (e) 
indicates that the agent binds to the compound so as 
to thereby identify the agent as one which inhibits 
binding of the CCR5 ligand to the CCR5 receptor. 

This invention also provides a method of identifying an 
agent which inhibits binding of a CCR5 ligand to a CCR5 
receptor which comprises: 

(a) immobilizing one of the compounds described herein on 
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a solid support; 

(b) contacting the immobilized compound from step (a) with 
the agent and detectable CCR5 ligand under conditions 
permitting binding of the CCR5 . ligand to the 

5 immobilized compound so as to form a complex; 

(c) removing any unbound CCR5 ligand; 

(d) measuring the amount of detectable CCR5 ligand which 
is bound to the immobilized compound in the complex; 

(e) measuring the amount of detectable CCR5 ligand which 
10 binds to the immobilized compound in the absence of 

the agent ; 

(f) comparing the amount of CCR5 ligand which is bound to 
the immobilized compound in step (e) with the amount 
measured in step (d) , wherein a reduced amount 

15 measured in step (d) indicates that the agent binds to 

the compound so as to thereby identify the agent as 
one which inhibits binding of the CCR5 ligand to the 
CCR5 receptor - 

20 In one embodiment of* the above method, the amount of the 
detectable CCR5 ligand in step (a) and step (e) is 
sufficient to saturate all binding sites for the CCR5 
. ligand on the compound. 

25 This invention also provides a method of identifying an 
agent which inhibits binding of a .CCR5 ligand to a CCR5 
receptor which comprises: 

(a) contacting one of the compounds described herein with 
the agent and detectable CCR5 ligand under conditions 
30 permitting binding of the CCR5 ligand to the compound 



so as to form a complex; 

(b) removing any unbound CCR5 ligand; 

(c) measuring the amount of detectable CCR5 ligand which 
is bound to the compound in the complex; 

(d) measuring the amount of detectable CCR5 ligand which 
binds to the compound in the SihsencB of the agent; 

(e) comparing the amount of CCR5 . ligand which is bound to 
the compound in step (c) with the amount measured in 
step (d) , wherein a reduced amount measured in step 
(c) indicates that the agent binds to the compound so 
as to thereby identify the agent as one which inhibits 

binding of the CCR5 ligand to the CCR5 receptor. 

I 

In one embodiment of the above method, the amount of the 
detectable CCR5 ligand in step (a) and step (d) is 
sufficient to saturate all binding sites for the CCR5 
ligand on the compound. 

In one embodiment of the above method the solid support is 
a microtiter plate well. In another embodiment, the solid 
support is a bead. In a further embodiment, the solid 
.support is a surface plasmon resonance sensor chip. The 
surface " plasmon resonance sensor chip can have pre- 
immobilized streptavidin . In " one embodiment, the surface 
plasmon resonance sensor chip is a BIAcore™ chip. 

In one embodiment of the above methods, the detectable CCR5 
ligand is labeled with a detectable marker. In another 
embodiment of the above methods, the CCR5 ligand is 
detected by contacting it with another compound which is 
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both capable of detecting the CCR5 ligand and is 
detectable. The detectable markers include those described 
above . 

5 This invention provides a method of identifying an agent 
which inhibits binding of a CCR5 ligand to a CCR5 receptor 
which comprises: 

a) immobilizing one of the compounds described 
herein on a solid support; 
10 b) contacting the immobilized compound from step a) 

with the agent dissolved or susjpended in a known 
vehicle and measuring the , binding signal 
generated by such contact; 

c) contacting the immobilized compound from step a) 
15 with the known vehicle in the absence of the 

compound and measuring the binding signal 
generated by such contact; 

d) comparing the binding signal measured in step b) 
with the binding signal measured in step c) , 

20 wherein an increased amount measured in step b) 

indicates that the agent binds to the compound so 
as to thereby identify the agent as one which 
binds to the CCR5 receptor. 

25 In one embodiment of the above method, the solid support is 
a surface plasmon resonance sensor chip. In another 
embodiment of the above method, the binding signal is 
measured by surface plasmon resonance. 



30 This invention provides a method of obtaining a composition 
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which comprises: 

(a) identifying a compound which inhibits binding of a 
CCR5 ligand to a CCR5 receptor according to one of the 
above methods ; and 

(b) admixing the compound so identified or a homolog or 
derivative thereof with a carrier. 

The invention provides agents identified in the screen. 
Such agents may have utility in treating HIV-1 infection or 
other CCR5 -mediated diseases, which include rheumatoid 
arthritis, asthma, multiple sclerosis, psoriasis, 
atherosclerosis and other inflammatory diseases. 

In one embodiment of the above methods, the CCR5 ligand is 
a complex comprising an HIV-1 envelope glycoprotein and a 
CD4 -based protein. The HIV-l envelope glycoproteins include 
but are not limited to gpl20, gpl40 or gpl60. The CD4-based 
proteins include but are not limited to soluble CD4 or CD4- 
IgG2 . 

As used herein, "CD4 " means the mature, native, membrane- 
bound CD4 protein comprising a cytoplasmic domain, a 
hydrophobic transmembrane domain, and an extracellular 
domain that binds to the HIV-1 gpl20 envelope glycoprotein. 
As used herein, "HIV-1 envelope glycoprotein" means the HIV- 
1 encoded protein which comprises the gpl2 0 surface 
protein, the gp41 transmembrane protein and oligomers and 
precursors thereof. As used herein, "CD4 -based protein" 
means any protein comprising at least one sequence of amino 
acid residues corresponding to that portion of CD4 which is 



-49- 

required for CD4 to form a complex with the HIV-1 gpl2 0 
envelope glycoprotein. As used herein, "CD4-IgG2" means a 
heterotetrameric CD4 -human IgG2 fusion protein encoded by 
the expression vectors deposited under ATCC Accession 
5 Numbers 75193 and 75194. 

In one embodiment of the above methods, the CCR5 ligand is 
a chemokine . The chemokines include but are not limited to 
RANTES, MlP-la or MIP-lp. As used herein, "RANTES", "MlP-la", 

10 and "MIP-13" denote members of the chemokine superfamily of 
proteins that direct the activation and migration of 
leukocytes and other cells involved in the inflammation, 
RANTES, MlP-la and MIP-lp are known to bind CCR5 and induce 
signaling. Their peptide sequences have been described 

15 (Wells et al. J. Leukocyte Biology, 59:53-60, 1996). 

In one embodiment of the above methods, the CCR5 ligand is 
an antibody. In one embodiment, the antibody is PAS (ATCC 
Accession No. HB-12605) . In another embodiment, the 
20 antibody is PAIO (ATCC Accession No. 12607). . In another 
embodiment, the antibody is PAll (ATCC Accession No. HB- 
12 608) . In another embodiment, the antibody is PA12 (ATCC 
Accession No. HB-12609) . 

25 This invention provides a compound having the structure: 

A - (aYDINYYTSEpA) n 

wherein each T represents a threonine, each S represents a 
serine, each .E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
30 each I represents an isoleucine; and each N represents an 



asparagine; wherein a represents from 0 to 9 amino acids, 
with the proviso that if there are more than 2 amino acids, 
they are joined together by peptide bonds in consecutive 
order and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the I at position 9 
and extending therefrom in the amino terminal direction; 
wherein 3 represents from 0 to 13 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the P at position 19 and extending 
therefrom in the carboxy terminal direction; wherein A 
represents a carboxyl group or an amidateid carboxyl group; 
wherein all of a, Y, D, I , N, Y, Y, T, S , E and p are joined 
together by peptide bonds, further provided that at least 
two tyrosines in the compound are sulfated, wherein n is an 
integer from 1 to 8, A is a polymer, and the solid line 
represents up to 8 linkers which attach the structure in 
parentheses to A. 

This invention also provides a compound having the 
structure : 

(GaYDINYYTSEp) n-A 

wherein each T represents a threonine, each S represents a 
serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoleucine; and each N represents an 
asparagine; wherein a represents from 0 to 9 amino acids, 
with the proviso that if there are more than 2 amino acids, 
they are joined together by peptide bonds in consecutive- 
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order and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the I at position 9 
and extending therefrom in the amino terminal direction; 
wherein 3 represents from 0 to 13 amino acids, with the 
5 proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID MO: 1 beginning with the P at position 19 and extending 
therefrom in the carboxy terminal direction; wherein 6 
10 represents an amino group or an acetylated amino group; 
wherein all of a, D, I ,N, Y, Y, T, S , E and (3 are joined 
together by peptide bonds, 

further provided that at least two tyrosines in the 
compound are sulfated, wherein n is an integer from 1 to 8, 
15 A is a polymer, and the solid line represents up to 8 
linkers which attach the structure in parentheses to A. 

This invention px"ovides a compound having the structure: 
A - (aYDINYYTSEpA)^ 

20 wherein each T represents a threonine, each S represents a 
serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoleucine; and each N represents an 
asparagine; wherein a represents from 0 to 9 amino acids, 

25 with the proviso that if there are more than 2 amino acids, 
they are joined together by peptide bonds in consecutive 
order and have a sequence identical to the sequence set 
forth in SEQ ID NO : 1 beginning with the I at position 9 
and extending therefrom in the amino terminal direction; 

30 wherein P represents from 0 to 333 amino acids, with the 
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proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the P at position 19 and extending 
5 therefrom in the carboxy terminal direction; wherein X 
repr-esents a carboxyl group or an amidated carboxyl group; 
wherein all of a, Y, D, I , N, Y, Y , T, S , E and 3 are joined 
together by peptide bonds, further provided that at. least 
two tyrosines in the compound are sulfated, wherein n is an 
10 integer from 1 to 8 , A is a polymer, and the solid line 
represents up to 8 linkers which attach! the structure in 
parentheses to A. 

This invention also provides a compound, having.- the 
15 structure : 

(eaYDINYYTSEp) „- A 

wherein each T represents a threonine, each S represents a 
serine, each E represents. a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 

20 each I represents an isoleucine; and each N represents an 
asparagine; wherein a represents from 0 to 9 amino acids, 
with the proviso that if there are more than 2 amino acids, 
they are joined together by peptide bonds in consecutive 
order and have a sequence identical to the sequence set 

25 fox'th in SEQ ID NO: 1 beginning with the I at position 9 
and extending therefrom in the amino, terminal direction; 
wherein p> represents from 0 to 3 33 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 

30 have a sequence identical to the sequence set forth in SEQ 
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ID NO: 1 beginning with the P at position 19 and extending 
therefrom in the carboxy terminal direction; wherein 6 
represents an amino group or an acetylated amino group; 
wherein all of a, Y, D, I , N, Y, Y , T, S , E and 3 are joined 
5 together by peptide bonds, further provided that at least 
two tyrosines in the compound are sulfated, wherein n is an 
integer from 1 to 8, A is a polymer, and the solid line 
represents up to 8 linkers which attach the structure in 
parentheses to A . 

10 

The polymer of the above compounds includes but is not 
limited to the following: a linear lysine polymer; a 
branched lysine polymers; a lineaar arginine polymer; a 
branched arginine polymer; and polyethylene glycol (PEG) , a 
15 linear acetylated lysine polymer, a branched acetylated 
lysine polymer, a linear chloroacetylated lysine polymer 
and a branched chloroacetylated lysine polymer. 

The above compounds can be produced by vai'ious methods 
20 known to those skilled in the art, including but not 
limited to the following. Methods for producing synthetic 
multimeric peptides such as multiple antigen peptides, 
synthetic polymeric constructs, and branched lysine 
oligopeptides are well known to those skilled in the art 
25 (Spetzler and Tarn, Int. J. Pept . Pr-ot . Res. 45:78, 1995; 
Yai et al . , J. Virol., 69:320, 1995; Okuda et al . , J. Mol . 
Recognit. 6:101, 1993). For example, radially branched 
peptides can be produced by performing standard solid-phase 
peptide synthesis methods using branched lysine skeletons 
30 on 4- (oxy-methyl) -phenylactamidomethyl or other suitable 
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solid resin. Peptide chains are elongated in parallel in a 
stepwise fashion using optimized t -butyl oxycarbonyl /benzyl 
chemistry as described (Sabatier et al . , Biochemistry 
32:2763, 1993). Peptides are liberated from the resin, 
■ purified by reversed-phase chromatography over a CIS or 
other suitable column and * characterized by analytical HPLC 
and mass spectroscopy. In another approach, monomeric 
peptides are synthesized, purified, and then covalently 
coupled to lysine copolymers using N- succinimidyl maleimido 
carboxylate chemistry. In another approach, the peptides 
can also be made in the form of affinity type multimers. 
For example, peptides may be synthesized with an affinity 
tag such as biotin. These affinity tagged peptides can then 
be mixed with affinity ligands capable of binding' 
multimerically , such as streptravidin . Other site - specif ic 
ligation chemistries are known to the skilled artisan. 

This invention provides a compound comprising the 
structure : 

eaYDnnynnnEpx 

wherein each E represents a glutamic acid, each D 
represents an aspartic acid, and each Y represents a 
tyrosine; 

wherein a represents from 0 to 9 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined by peptide bonds in consecutive order and have a 
sequence identical to the sequence set forth in SEQ ID NO: 
1 beginning with the I at position 9 and extending 
therefrom in the amino terminal direction; 

wherein (3 represents from 0 to 13 amino acids, with the 
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proviso that if there are more than 2 amino acids, they are 
joined by peptide bonds in consecutive order and have a 
sequence identical to the sequence set forth in SEQ ID NO: 
1 beginning with the P at position 19 and extending 
5 therefrom in the carboxy terminal direction; 

wherein 0 represents an amino group or an acetylated amino 
group; wherein X represents a carboxyl group or an amidated 
carboxyl group; 

wherein n represents any amino acid, 
10 wherein all of a, Y, D, n, n, Y, n, n, n, E and (5 are joined 
together by peptide bonds; 

further provided that at least two tyrosines in the 
compound are sulfated. 

15 In one embodiment of this compound, the compound comprises 
amino acids in addition to those in the YDIlIIYnnnE peptide, 
and such amino acids correspond to those present in the 
CCR5 receptor sequence set forth in SEQ ID NO : 1 , yet an 
amino acid may be replaced with a homologous amino acid. 

20 The sequence YDIITlYnnnE corresponds to amino acid residues 
10-18 of the sequence set forth in SEQ ID NO : 1 . For 
example, if the peptide has one additional amino acid on 
its N terminal end, then the sequence could be lYDnnYnnilE 
or alternatively, the I could be replaced with G, A, V or 

25 L. 

In one embodiment of the above compound, the compound is a 
peptide which comprises consecutive amino acids having the 
sequence YDOnYnnnE . 

30 



In one embodiment of the above compound, the tyrosines at 
positions 1 and 5 of the sequence YDnnYnnnE are sulfated. 

As used herein, "homologous amino acids" are those which 
have chemically similar side chains. For example, aliphatic 
side chains (G, A, V, L and I) ; aromatic side chains (F, Y 
and W) ; basic aide chains (K, R and H) ; acidic side chains 
(D and E) ; amide side chains (N and Q) ; aliphatic hydroxyl- 
containing side chains (S and T) ; sulfur-containing side 
chains (C and M) . Homology between amino acids may also be 
drawn on other bases, such as size, pjolarity, hydrogen 
bonding potential, hydrophilicity and hydrophobicity. 
Proline differs from the above amino acids in that it 
contains a secondary rather than primary imino group. 
Accordingly, proline miay be considered an imino group. 
Substitution or proline with another amino acid (e.g. G, A 
or S) can increase the flexibility of a peptide. 
Conversely, substitution of another amino acid with a 
pipeline can stabilize a desired conformation. 

This invention provides a compound comprising the 

structure : 

eaYDINYYTSEpX 

wherein each T represents a threonine, each S represents a 
serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoleucine; and each N represents an 
asparagine; 

wherein a represents from 0 to 9 amino acids, with the 
proviso that if thex-e are more than 2 amino acids, they are 
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joined by peptide bonds in consecutive order and have a 
sequence identical to the sequence set forth in SEQ ID NO: 
1 beginning with the I at position 9 and extending 
therefrom in the amino terminal direction; 
5 wherein (3 represents from 0 to 13 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined by peptide bonds in consecutive order and have a 
sequence identical to the sequence set forth in SEQ ID NO: 
1 beginning with the P at position 19 and extending 
10 therefrom in the carboxy terminal direction; 

wherein 0 represents an amino group or an acetylated amino 
group; wherein X represents a carboxyl group or an amidated 
carboxyl group; 

wherein all of a, Y, D, I , N, Y, Y, T, S , E and |3 are joined 
15 together by peptide bonds; 

further provided that at least two tyrosines in the 
compound are sulfated, 

wherein any amino acid except for the Y at position 1, D at 
position 2, Y at position 5 and E at position 9 may be 
20 replaced with a homologous amino acid. 

In one embodiment of the above compound, with respect to 
replacing homologous amino acids, any I amino acid residue 
may be replaced with a G,A,V or L amino acid residue. In 

25 one embodiment of the above compound, any N amino acid 
residue may be replaced with a Q amino acid residue. In one 
embodiment of the above compound, any Y amino acid residue 
may be replaced with a F or W amino acid residue. In one 
embodiment of the above compound, any T amino acid residue 

30 may be replaced with a S amino acid residue. In one 



embodiment of the above compound, any S amino acid residue 
may be replaced with a T amino acid residue. In one 
embodiment of the above compound, any C may be replaced 
with M,S,T,A,G,N, or Q. 

5 

In one embodiment, a C amino acid residue within the 3 
region of the compound may be replaced with any other amino 
acid. 

10 This invention provides an agent which binds to an epitope 
of HIV-1 gpl20, which epitope comprises amino acid residues 
R298, N301, T303, 1322, D324, 1325, R326, 1420, K421, Q422, 
W427, thereby inhibiting binding of HIV-1 gpl20 to a CCR5 
chemokine receptor . 

15 

The above amino acid numbering is per HIV-1 strain HxB2 
(Genbank Accession No. AAB50262). Amino acids D324, 1325 
and R326 are dex-ived from HIV-1 strain JR-FL (Genbank 
Accession No. AAB05604) . 

20 



25 



30 
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The amino acid sequence (SEQ ID NO: 17) for HIV-1 HxB2 gpl2 0 
is set forth below: 

1 MRVKEKYQHL WRWGWRWGTM LLGMLMICSA TEKLWVTVYY GVPVWKEATT TLFCASDAKA 
61 yDTEVKWVWA THACVPTDPN PQEWLVNVT ENFNMWKNDM VEQMHEDIIS LWDQSLKPCV 
5 121 KLTPLCVSLK CTDLKNDTNT NSSSGRMIME KGEIKNCSFN ISTSIRGKVQ KEYAFFYKLD 
181 IIPIDNDTTS YKLTSCNTSV ITQACPKVSF EPIPIHYCAP AGFAILKCNN KTFNGTGPCT 
241 NVSTVQCTHG IRPWSTQLL LNGSLAEEEV VIRSVWFTDN AKTIIVQIjNT SVEINCTRPN 
3 01 NNTRICRIRIQ RGPGRAFVTI GKIGNMRQAH CNISRAKWNN TLKQIASKLR EQFGNNKTII 

3 61 FKQSSGGDPE IVTHSFNCGG EFFYCNSTQL FNSTWFNSTW STEGSRWTEG SDTITLPCRI 
10 421 KQIINMWQKV GKAMYAPPIS GQIRCSSNIT GLLLTRDGGN SNNESEIFRP GGGDMRDNWR 

4 81 SELYKYKWK lEPLGVAPTK AKRRWQREK R 



The amino acid sequence (SEQ ID NO: 16) for HIV-1 JR-FL 
15 gpl20 is set forth below: 

1 MRVKGIRKSY QYLWKGGTLL LGILMICSAV EKLWVTVYYG VPVWKEATTT LFCASDAKAY 
61 DTEVHNVWAT HACVPTDPNP QEWLEKFVTE HFNMWKKTWNV EQMQEDIISI. WDQSLKPCVK 
121 LTPLCVTLNC KDVNATNTTN DSEGTMERGE HOSTCSFNITT SIRDEVQKEY ALFYKLDWP 
181 IDNlNTNTSYRL ISCDTSVITQ ACPKISFEPI PIHYCAPAGF AILKCNDKTF NGKGPCKNVS 
20 241 TVQCTHGIRP WSTQLLLNG SLAEEEWIR SDNFTNNAKT IIVQLKESVE INCTRPNNNT 
3 01 RKSIHIGPGR AFYTTGEIIG DIRQAKCNIS RAKWNDTLKQ IVIKLREQFE NKTIVFNHSS 

3 61 GGDPEIVMHS FNCGGEFFYC NSTQLFNSTW NIvTInTTEGSNNT EGNTITLPCR IKQIINMWQE 
421 VGKAMYAPPI RGQIRCSSNI TGLLLTRDGG INENGTEIFR PGGGDMRDNW RSELYKYKW 

4 81 KIEPLGVAPT KAKRRWQRE kr 

25 

This invention provides the above agent, wherein the 
epitope is altered or masked by an alanine substitution of 
at least one of the amino acid residues R298, N301, T303, 
1322, D324, 1325, R326, 1420, K421, Q422 and W427. 

30 

This invention provides an agent which binds to an epitope 
of HIV-1 gpl20, which epitope comprises amino acid residues 
R298, N301, T303, 1322, D324, 1325, R326, 1420, K421, Q422, 
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W427, thereby inhibiting HIV-1 infection of a CD4+ CCR5+ 
cell. 

This invention provides the above agent, wherein the 
5 epitope is altered or masked by an alanine substitution of 
at least one of the amino acid residues R298, N301, T303, 
1322, D324, 1325, R326, 1420, K421, Q422 and W427. 

In one embodiment of any of the above agents, the agent is 
10 a peptide- In one embodiment of any of the above agents, 
the peptide comprises consecutive amino; acids having the 
sequence YDINYYTSE . In one embodiment at least two 
tyrosines in the compound are sulfated. In one embodiment, 
the tyrosines at positions 1 and 5 of the sequence 
15 YDINYYTSE are sulfated. 

In one embodiment of any of the above agents, the agent is 
an antibody or portion of an antibody. In one embodiment of 
any of the above agents, the agent is a nonpeptidyl agent. 
20 In one embodiment of ' any of the above agents, the agent is 
a peptidyl agent. 

This invention provides a method of inhibiting HIV-1 
infection of a CD4+CCR5-f cell which comprises contacting 
25 the cell with an amount of an agent of the subject 
invention effective to bind to HIV-1 gpl20, so as to 
thereby inhibit HIV-1 infection of the CD4+ CCR5+ cell. 

This invention provides a compound having one of the 
30 following structures: 
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A- (aYDINYYTSEpA) , OaYDINYYTSEP ) -A, or A- (aYDINYYTSEp) -A 

wherein each' T represents a threonine, each S • represents a 
serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
5 each I represents an isoleucine; and each N represents an 
asparagine ; 

wherein a represents from 0 to 9 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
10 have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the I at position 9 and extending 
therefrom in the amino terminal direction; 

wherein 3 represents from 0 to 13 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
15 joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the P at position 19 and extending 
therefrom in the carboxy terminal direction; 

wherein X represents a carboxyl group or an amidated 
20 carboxyl group ; 

wherein 0 represents an amino group or an acetylated amino 
group ; 

wherein all of a, Y, D, I ,N, Y, Y, T, S, E and (3 are joined 
together by peptide bonds, 
25 further provided that at least two tyrosines in the 
compound are sulfated, 

wherein A is a molecule that self -oligomerizes , and the 
solid line repi^esents a peptide linker or a peptide, 
disulfide, or other chemical bond. 



30 



As used herein "peptide linker" is a peptide comprising 
consecutive amino acids having a sequence which includes 
but is not limited to GAG, SGGRGG and QSTRGGASGGG or 
repeating units thereof. One skilled in the art would know 
other flexible peptide linkers. 

In one embodiment of the above compound^ the peptide that 
self -oligomerizes contains alpha-helical regions capable of 
forming coiled coils. 

The a-helical coiled coil (48) is probably the most 
widespread subunit oligomerization motif found in proteins 
(48-52) . It is a type of protein structure consisting of 
two to five amphipathic a-helices that "coil" around each 
other in a left-handed supertwist (48-52) . The sequences of 
coiled are characterized by a heptad repeat of seven 
residues with a hydrophobic repeat of mostly apolar amino 
acids . 

In one embodiment of the above compound, the peptide that 
self -oligomerizes is a peptide having a sequence of at 
least a portion HIV-1 gp41 heptad repeat sequence 1. In one 
embodiment, the HIV-1 gp41 heptad repeat sequence 1 is 
RQLLS G I VQQQNNLLRAI E AQQHLLQLTVWG I KQLQARI LAYER YLKDQ ( SEQ ID 
;.NO:3) . 

In one embodiment of the above compound, the peptide that 
self -oligomerizes is a peptide having a sequence of at 
least a portion of an HIV-1 gp41 heptad repeat sequence 2. 
In one embodiment, the HIV-1 gp41 heptad repeat sequence 2 
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is WMEWDREINNYTSLIHSLIEESQNQQEKNEQELLE (SEQ ID NO: 4) . 

In one embodiment of the above compound, the peptide that 
self -oligomerizes is a peptide having a sequence 
5 • corresponding to at least a portion of the leucine zipper 
region of transcription factor GCN4 . In one embodiment, 
the sequence of the leucine zipper region of transcription 
factor GCN4 is HMKQLEDKVEELLSKNYHLENEVARLKKLVGER (SEQ ID 
NO : 6 ) . 

10 

In one embodiment of the above compound, j the peptide that 
self -oligomerizes is a peptide having a sequence 
corresponding to at least a portion of the leucine zipper 
region of transcription factor GCN4 . In one embodiment, the 
15 sequence is derived from the leucine zipper region of 
transcription factor GCN4 . In one embodiment, the sequence 
forms trimeric coiled-coils . In one embodiment, the 
sequence is KMKQIEDKIEEILSKIYHIENEIARIKKLIGEV (SEQ ID 
N0:7) . 

20 

In one embodiment of the above compound, the peptide that 
self -oligomerizes is a peptide having a sequence 
corresponding to at least a portion of a leucine zipper 
region of a human protein. The human protein includes but 

25 is not limited to transcription activator c-fos, 
transcription activator c-jun, enzyme quiescent cell 
proline dipeptidase, macrophage scavenger receptor, 
salivary mucin (MUC7) , or human quiescent cell proline 
dipeptidase (QPP) . In one embodiment, the human protein is 

30 QPP and the leucine zipper region has the sequence 
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LLTVEQALADFAELLRAL.RRDL (SEQ ID NO: 5). In one embodiment, 
the transcription activator is c-fos and ' the leucine zipper 
region has the sequence 

LTDTLQAETDQLEDEKSALQTEIANLLKEKEKLEFILAAR (SEQ ID NO:8) . In 
one embodiment, the transcription activator is c-jun and 
the leucine zipper region has the sequence 
HMRRIARLEEKVKTLKAQNSELASTAJWLREQVAQLKQKY (SEQ ID NO; 9). 

In one embodiment of the above compound, the peptide that 
self-oligomerizes is a peptide having a sequence 
corresponding to that of at least a portion of an antibody. 
In one embodiment, the portion of the antibody comprises 
the heavy chain. In one embodiment, the portion of the 
antibody heavy chain comprises the heavy chain constant 
region. In one embodiment, the portion of the antibody 
heavy chain comprises the hinge and Fc domains. In one, 
embodiment, the portion of the antibody heavy chain 
comprises the Fc domain. In one embodiment, the portion of 
the antibody comprises the light chain. In one embodiment, 
the portion comprises the light chain constant region. In 
one embodiment, the portion of the antibody comprises the 
heavy and light chains. 

This invention provides a compound having one of the 
following structures: 

A- (aYDINYYTSEpX) , (eaYDINYYTSEp) -A, or A- (aYDINYYTSEp ) -A 
wherein each T represents a threonine, each S represents a 
serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoleucine; and each N represents an 
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asparagine ; 

wherein a represents from 0 to 9 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
5 have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the I at position 9 and extending 
therefrom in the amino terminal direction; 

wherein 3 represents from 0 to 13 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
10 joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the P at position 19 and extending 
therefrom in the carboxy terminal direction; 

wherein X represents a carboxyl group or an amidated 
15 carboxyl group; 

wherein 0 represents an amino group or an acetylated amino 
group ; 

wherein all of a, Y , D, I , N, Y, Y, T, S , E and (3 are joined 
together by peptide bonds, 
20 further provided that at least two tyrosines in the 
• compound are sulfated, 

wherein A is toxin, and the solid line represents a peptide 
linker or a peptide, disulfide, or other chemical bond. 

25 In one embodiment of the above compound, the toxin is a 
radionuclide. In one embodiment, the radionuclide is an 
alpha-emitting isotope. The alpha-emitting isotope includes 
but is not limited to ^"Ac, ^ii^t, ^^^B± , or ^^^Bi . In one 
embodiment, the radionuclide is a beta-emitting isotope. 

30 The beta-emitting isotope includes but is not limited to 



^«^Rh, ^««Rh ^oy^ Z3ij^ or ^-Cu. In one embodiment, the 
radionuclide may be emitting Auger and low energy electron. 
The radionuclide includes but is not limited to "^I, ^^sj 
^•^Br. 



In one embodiment of the above compound, the toxin is a 
chemical toxin. The chemical toxin may be a peptidyl 
chemical toxin. The peptidyl chemical toxin includes but is 
not limited to ricin. The chemical toxin may be a 
nonpeptidyl chemical toxin. The nonpeptidyl chemical toxin 
includes but is not limited to calicheamycin . 

This invention provides a compound having one of the 
following structures: 

A- (aYDINYYTSEpX) , ( GaYDINYYTSEP ) -A, or A- {o:YDINYYTSE(3 ) -A, 

wherein each T represents a threonine, each S represents a 
.ser-ine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspai'tic acid, 
each I represents an isoleucine; and each N represents an 
asparagine; 

wherein a represents from 0 to 9 amino acids, with the 
proviso that if there- are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the I at position 9 and extending 
therefrom in the amino terminal direction; 

wherein p represents from 0 to 13 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
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ID NO: 1 beginning with the P at position 19 and extending 
therefrom in the carboxy terminal direction; 

wherein X represents a carboxyl group or an amidated 
carboxyl group; 

5 wherein 6 represents an amino group or an acetylated amino 
group; 

wherein all of a, Y, D, I , N, Y, Y, T, S , E and (3 are joined 
together by peptide bonds, 

further provided ' that at least two tyrosines in the 
10 compound are sulfated, 

wherein A is molecule with anti-HIV activity, and the solid 
line represents a peptide linker or a peptide, disulfide, 
or other chemical bond. 

15 In one embodiment of the above compound, the molecule with 
anti-HIV activity is a CD4 - immunoglobulin fusion protein. 
In one embodiment, the CD4 - immunoglobulin fusion protein is 
CD4-IgG2, wherein the CD4-IgG2 comprises two heavy chains 
and two lights chains, wherein the heavy chains are encoded 

20 by an expression vector designated CD4- IgG2HC-pRcCMV (ATCC 
Accession No. 75193) and the light chains are encoded by an 
expression vector designated CD4 -kLC-pRcCMV (ATCC Accession 
No. 75194) . 

25 In one embodiment of the above compound, the molecule with 
anti-HIV activity is a compound which retards gp41 from 
adopting a conformation capable of mediating fusion of HIV- 
1 to a CD4+ cell by binding noncovalently to an epitope on 
a gp41 fusion intermediate. In one embodiment, the compound 

30 comprises a peptide selected from the group consisting of 
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T-20 (SEQ ID NO: 10), DP107 (SEQ ID NO: 11), N34 (SEQ ID 
NO: 12), C28 (SEQ ID NO: 13), N34(L6)C28 (SEQ ID NO: 14), 
and T1249 (SEQ ID NO: 15) . 

As used herein, "T-20" and "DP178" are used interchangeably 
to denote a peptide having the following amino acid 
sequence : YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF (SEQ ID 
N0:10) and as described [53,54]. 

DP107 has the following amino acid sequence: 
NNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQ (SEQ ID NO: 11) 

N34 has the following amino acid sequence: 
SGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQAR (SEQ ID NO: 12) 

C2 8 has the following amino acid sequence: 
WMEWDREINNYTSLIHSLIEESQNQQEK (SEQ ID NO: 13) 

N34(Lj6)C28 has the following amino acid sequence: 

SGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQARSGGRGGWMEWDREINNYTSLIHSLI 
EESQNQQEK (SEQ ID NO: 14) 

T124 9 has the following amino acid sequence : 
WQEWEQKITALLEQAQIQQEKNEYELQKXjDKWASLWEWF (SEQ ID NO: 15) 

This invention provides the above compound wherein the 
molecule with anti-HIV activity is a CCR5 chemokine 
receptor targeting agent. 



In one embodiment, the CCR5 chemokine receptor targeting 
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agent is an antibody or portion of an antibody. In one 
embodiment, the antibody includes but is not limited to PAS 
(ATCC Accession No. HB-12605) , PAIO (ATCC Accession 
No. 12607), PAll (ATCC Accession No. HB-12608), PA12 (ATCC 
5 Accession No. HB-12609) , and PA14 (ATCC Accession No. HB- 
12 610) . In one embodiment, the antibody is PA14 (ATCC 
Accession No. HB-12610) . 

The antibody may be a monoclonal antibody or polyclonal 
10 antibody. The monoclonal antibody may be a human, humanized 
oi" chimeric antibody. This invention provides humanized 
forms of the above antibodies. 

As used herein, "humanized" describes antibodies wherein 

15 some, most or all of the amino acids outside the CDR 
regions are replaced with corresponding amino acids derived 
from human immunoglobulin molecules. In one embodiment of 
the humanized forms of the antibodies, some, most or all of 
the amino acids outside the CDR regions have been replaced 

20 with amino acids from human immunoglobulin molecules but 
where some, most or all amino acids within one or more CDR 
regions are unchanged. Small additions, deletions, 
insertions, substitutions or modifications of amino acids 
are permissible as long as they would not abrogate the 

25 ability of the antibody to bind a given antigen. Suitable 
human immunoglobulin molecules would include IgGl, IgG2 , 
IgG3 , lgG4 , IgA and IgM molecules. A "humanized" antibody 
would retain a similar antigenic specificity as the 
original antibody, i.e., in the present invention, the 

30 ability to bind CCR5 . 



One skilled in the art would know how to make the humanized 
antibodies of the subject invention. Various publications, 
several of which are hereby incorporated by I'eference into 
this application, also describe how to make humanized 
antibodies. For example, the methods described in United 
States Patent No. 4,816,567 (55) comprise the production of 
chimeric antibodies having a variable region of one 
antibody and a constant region of another antibody. 

United States Patent No. 5,225,539 (56) describes another 
approach for the production of a humanized antibody. This 
patent describes the use of recombinant DNA technology to 
produce a humanized antibody wherein the CDRs of a variable 
region of one imimunoglobul in are replaced with the CDRs 
from an immunoglobulin with a different specificity such 
that the humanized antibody would recognize the desired 
target but would not be recognized in a significant way by 
the human subject's immune system. Specifically, site 
directed mutagenesis is used to graft the CDRs onto the 
framework. 

Other approaches for humianizing an antibody are described 
in United States Patent Nos . 5,585,089 (57) and 5,693,761 
(58) and WO 90/07861 which describe methods for producing 
humanized immunoglobulins. These have one or more CDRs and 
possible additional amino acids from a donor immunoglobulin 
and a framework region from an accepting human 
immunoglobulin. These patents describe a method to increase 
the affinity of an antibody for the desired antigen. Some 
amino acids in the framework are chosen to be the same as 
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the amino acids at those positions in the donor rather than 
in the acceptor. Specifically, these patents describe the 
preparation of a humanized antibody that binds to a 
receptor by combining the CDRs of a mouse monoclonal 
5 antibody with human immunoglobulin fi'amework and constant 
regions . Human f z'amework regions can be chosen to maximize 
homology with the mouse sequence. A computer model can be 
used to identify amino acids in the framework region which 
are likely to interact with the CDRs or the specific 
10 antigen and then mouse amino acids can be used at these 
positions to create the humanized antibody. 

The above patents 5,585,089 and 5,693,761, and WO 90/07861 
(59) also propose four possible criteria which may used in 

15 designing the humanized antibodies. The first proposal was 
that for an acceptor, use a framework from a particular 
human immunoglobulin that is unusually homologous to the 
donor immunoglobulin to be humanized, or use a consensus 
framework from many human antibodies . The second proposal 

20 was that if an amino acid in the framework of the human 
immunoglobulin is unusual and the donor amino acid at that 
position is typical for human sequences, then the donor 
amino acid rather than the acceptor may be selected. The 
third proposal was that in the positions immediately 

25 adjacent to the 3 CDRs in the humianized immunoglobulin 
chain, the donor amino acid rather than the acceptor amino 
acid may be selected. The fourth proposal was to use the 
donor amino acid reside at the framework positions at which 
the amino acid is predicted to have a side chain atom 

30 within 3A of the CDRs in a three dimensional model of the 
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antibody and is predicted to be 
the CDRs . The above methods are 
of the methods that one skilled 
make humanized antibodies. 



capable of interacting with 
merely illustr-at ive of some 
in the art could employ to 



This invention provides the above compound, wherein the 
portion of the antibody is a Fab fragment of the antibody. 
This invention provides the above compound, wherein the 
portion of the antibody comprises the variable domain of 
the antibody. This invention provides the above compound, 
wherein the portion of the antibody comprises a 
complementarity determining region or CDR portion of the 
antibody. The monoclonal antibody includes but is not 
limited to an IgG, IgM, IgD, IgA, or IgE monoclonal 
antibody. 

This invention provides the above compound, wherein the 
molecule with anti-HlV activity is a chemokine or chemokine 
derivative. The chemokine includes but is not 'limited to 
RANTES, MlP-la, MIP-lp, SDF-1 or other chemokine which 
blocks HIV-1 infection. The chemokine derivative includes 
but is not limited to Met-RANTES, AOP-RANTES, RANTES 9-68, 
or NNY- RANTES . 

The molecule may also be a non-chemokine agent capable of 
binding to chemokine receptors and inhibiting fusion of 
HIV-1 to CD4* cells. The non-chemokine agents include, but 
are not limited to, chemokine fragments and chemokine 
derivatives and analogues. In one embodiment, the agent 
does not include naturally occurr-ing chemokines . The non- 
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chemokine agents include multimeric forms of the chemokine 
fx^agments and chemokine derivatives and analogues or fusion 
molecules which contain chemokine fragments, derivatives 
and analogues linked to other molecules. In one 

5 embodiment, the non-chemokine agents do not include 
bicyclams and their derivatives as described in U.S. Patent 
No. 5,021,409, issued June 4, 1991, the content of which is 
incorporated' by reference into this application. Some 
bicyclam derivatives have been previously described with 
10 antiviral activities (60, 61). 

In an embodiment of this invention, the non-chemokine agent 
is an oligopeptide. In another embodiment, the non- 
chemokine agent is a polypeptide. In still another 

15 embodiment, the non-chemokine agent is an antibody or a 
portion thereof. Antibodies against the chemokine receptor- 
may easily be generated by routine experiments. It is also 
within the level of ordinary skill to synthesize fragments 
of the antibody capable of binding to the chemokine 

20 receptor. In a further embodiment, the non-chemokine agent 
is a nonpeptidyl agent such as TAK-779 (64) or AMD3100 
(65) . 

Non-chemokine agents which are purely peptidyl in 
25 composition can be either chemically synthesized by solid- 
phase methods (62) or produced using recombinant technology 
in either prokaryotic or eukaryotic systems. The synthetic 
and recombinant methods are well known in the art. 

30 Non-chemokine agents which contain biotin or " other 
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nonpeptidyl groups can be prepared by chemical modification 
of synthetic or recombinant chemokines or non-chemokine 
agents. One chemical modification method involves 

periodate oxidation of the 2 -amino alcohol present on 
chemokines or non-chemokine agents possessing serine or 
threonine as their N-terminal amino acid (63) . The 
resulting aldehyde group can be used to link peptidyl or 
non-peptidyl groups to the oxidized chemokine or non- 
chemokine agent by reductive amination, hydrazine, or other 
chemistries well known to those skilled in the art. 

This invention provides a compound having one of the 
following structures: 

A- (aYDINYYTSE3A) , (GcxYDINYYTSEp ) - A, or A- (aYDINYYTSEp ) - A 

wherein each T represents a threonine, each S represents a 
serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoleucine; and each N represents an 
asparagine ; 

wherein a represents from 0 to 9 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the I at position 9 and extending 
therefrom in the amino terminal direction; 

wherein 3 represents from 0 to 13 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the P at position 19 and extending 
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therefrom in the carboxy terminal direction; 

wherein X represents a carboxyl group or- .an amidated 
carboxyl group; 

wherein 0 represents an amino group or an acetylated amino 
5 group; 

wherein all of a, Y, D, I ,N, Y, Y, T, S , E and (3 are joined 
together by peptide bonds, 

further provided that at least two tyrosines in the 
compound are sulfated, 
10 wherein A is peptidyl or nonpeptidyl agent, and the solid 
line represents a peptide linker, or a peptide, disulfide, 
or other* chemical bond. 

In one embodiment, the A is a nonpeptidyl agent, and the 
15 nonpetidyl agent polyethylene glycol. 

This invention will be better understood from the 
Experimental Details that follow. However, one skilled in 
the art will readily appreciate that the specific methods 
20 and results discussed are merely illustrative of the 
invention as described more fully in the claims that follow 
thereafter . 
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EXPERIMENTAL DETAILS 

First Series of Experiments 
A . Materials 

5 Purified recombinant CD4-IgG2 protein was produced by 
Progenies Pharmaceuticals, Inc. from plasmids CD4-IgG2-HC- 
pRcCMV and CD4 -kLC-pRcCMV as described (Allaway et al . AIDS 
Res. Hum. Retroviruses 11:533, 1995). Soluble CD4 is 
commercially available (NEN Life Science Products, Boston, 
10 MA) . Anti-CCR5 MAb 2D7 was purchased from Pharmingen (San- 
Diego, CA) . 

The plasmids designated PPI4 - tPA-gpl20jR.Fi,-V3 and PPI4-tPA- 
gpl2 0DHi23 were prepared as described (Hasel et al , US 

15 Patents 5,869,624 and 5,886,163). Monomeric gpl20 

glycoproteins were produced in CHO cells stably transfected 
with the PPI4-tPA-gpl20 plasmids and purified to 
homogeneity as described (Hasel et al . US Patents 5,869,624 
and 5,886,163; Trkola et ai . Nature 384:184, 1996). The 

20 antibodies designated PAS, PAIO, PA12 and PA14 were 
prepared by growing the corresponding hybridoma cell line 
in mouse ascites and isolating the antibody using protein A 
affinity chromatography as described (Olson et al . J.Virol. 
73:4145, 1999). LI . 2 -CCR5^ eel 1 s were cultured as described 
25 (Olson et al . J.Virol. 73:4145, 1999). 

Peptides containing different segments of the CCR5 Nt were 
custom- synthesized by solid-phase f luorenylmethoxycarbonyl 
chemistry using phospho- and sulfo- tyrosine precursors as 
30 building blocks where indicated (Figure 6) . Biotinylated 
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versions of peptides S-10/14 and P-10/14 incorporated a C- 
terminal GAG spacer preceding a biotinylated lysine. 
Following cleavage from the resin, peptides were purified 
by revei-se-phase chromatography on CIS columns (Vydac, 
5 Hesperia, CA) and analyzed by HPLC and mass spectroscopy. 
Figure 6 describes the different peptides that were used in 
this study. 

Binding of apl2 0 to CCR5 

10 A gpl20/CD4 complex formed from monomeric gpl20 (lOOnM) and 
biotinylated CD4-IgG2 (50nM) was added to 1x10^ L1.2-CCR5* 
cells in the presence of different concentrations of 
peptide (Olson et al . J.Virol. 73:4145, 1999). CD4-IgG2 is 
tetrameric and therefore binds four molecules of gpl2 0, 

15 which increases binding of the complex to CCR5 (Allaway et 
al . AIDS Res. Hum. Retroviruses 11:533, 1995). The mean- 
fluorescence intensity (m.f.i.) was measured by flow 
cytometry after addition of phycoerythrin (PE) -labeled 
streptavidin (Becton Dickinson, San Jose, CA) . Inhibition of 

20 gpl20/CCR5 binding was calculated: (m.f.i. with 

peptide) / (m. f . i . without peptide) xlOO%. 

It was first tested whether tyrosine- sulfated peptides 
spanning amino acids 2-18 of the CCR5 Nt could inhibit 

25 binding of the gpl2 0jr.fl/*^D4 - IgG2 complex to CCR5* cells. The 
HIV-ljR.FL isolate exclusively uses CCR5 as a co-receptor 
(Dragic et al , Nature 381:667, 1996). Only peptides S- 
3/10/14 and S-10/14 inhibited complex binding to the cells 
in a dose-dependent manner (Fig. la). Peptides S-10 and S- 

30 14 had no inhibitory activity, even at the highest 
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concentrations (Fig. la). Peptide TS-10/14, spanning amino 
acids 10-14, did not inhibit gpl20jR.pL/CD4 - IgG2 binding to 
CCR5^ cells, ^ despite the presence of two sulfo- tyrosine 
residues (Fig. lb) . 

5 

Tyrosine -phosphcrylated peptides P-10/14 and P-3/10/14 did 
not inhibit gpl2 0jR.px./CD4 - IgG2 binding to CCRS" cells (Fig. 
lb) . As further specificity controls we synthesized 
peptides containing the first seventeen residues of the 
10 CCR5 Nt in random order with sulfo- tyrosines in positions 
10 and 14 (SS-lO/14) or in positions 2 and 12 (SS-2/12) . 
Neither one of these peptides reduced, gpl2 Ojr_fl/CD4 -IgG2 
binding to CCR5'' cells, even at the highest concentrations 
(Fig. lb) . 

15 

Surface plasmon resonance measurements (BlAcore) 
Streptavidin-coated sensor chips (BIAcore AB, Sweden) were 
conditioned with five injections of x^egeneration solution 
(IM NaCl, 50mM NaOH) and equilibrated with HBS-EP buffer 

20 (lOmM HEPES, ISOmM NaCl, 3M EDTA, 0.005% polysorbate 20) as 
recommended by the manuf actui^er . Biotinylated peptides were 
then immobilized on the chip by injection of peptide 
(lOOnM) in HBS-EP buffer, followed by an injection of 
r^egeneration solution and equilibration with HBS-EP buffer. 

25 4 00 resonance units (RU) of peptide were bound to the 
sensor chip surface. Solutions of the following proteins 
(lOOnM) were passed over the sensor chip surface: gpl20, 
sCD4, gpl2 0/sCD4, PAS, PAIO and 2D7 . Surface plasmon 
resonance was monitored and displayed in arbitrary 

30 resonance units (RU) as a function of time. Following 



injection of each solution the chip was regenerated and 
equilibrated as described above, 

Biotinylated peptide was attached to the streptavidin- 
coated gold surface of a sensor chip and solutions 
containing different gpl20/sCD4 complexes were flowed over 
the immobilized peptide. Adsorption of the complex due to 
complex/peptide binding was detected by an increase in 
surface plasmon resonance signal (RU) , which reports 
changes in the effective refi-action index very near the 
gold surface of the sensor chip (Schuck Ann. Rev. Biophys 
Biomol Struct 26:541, 1997). For proteins of similar size, 
such as the different gpl20/sCD4 complexes, RU plateau 
values are directly proportional to the amount of protein 
bound to the peptide. 

Specific association of the gpl2 Ojr.pl/sCD4 complex with the 
sulf o- tyrosine-containing peptide bS-10/14 was accompanied 
by a significant increase in RU (Fig. 2a) . The signal 
plateau but not the shape of the sensograms varied with 
gpl2 0jR_FL/sCD4 concentration indicating that the 
peptide/complex interaction was dose-dependent (data not 
shown) . The sensorgram obtained with bP-10/14 is similar 
to the one obtained in the absence of peptide, indicating a 
complete lack of association of the phosphorylated peptide 
with the protein complex (Fig. 2a) . Neither gpl20jR,pL nor 
sCD4 alone produced a significant increase in RU, 
indicating that they did not associate with the immobilized 
peptides. (Fig. 2b, c). The gpl2 0 - AV3jr.fl/sCD4 complex was 
also unable to associate with the peptides (Fig. 2d) 
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To further ascertain the specificity of the peptide/complex 
association we performed BIAcore analyses using envelope 
glycoproteins from HIV-1dhi23/ an R5X4 isolate, and UlV-l^^, 
an X4 isolate (5) . gpl2 0chi23/sCD4 associated specifically 
with the sulfated peptide, although the plateau RU values 
were lower than those observed with gpl 2 0jr.pl/sCD4 (Fig. 
2e) . We did not detect any binding of gpl2 0dhi23/sCD4 to the 
phosphorylated peptide (Fig.2e), nor did gpl20DHi23 alone 
associate with the peptides (Fig. 2f ) . Finally, gpl20^j. with 
or without sCD4 was not able to associate with either one 
of the peptides (Fig. 2g,h). 

These methods could be readily modified to screen for 
agents that bind CCR5 or that block its interaction with 
antibodies, gpl2 0 or other ligands. For example, direct 
binding of the agents could be analyzed as described above, 
where the agent is substituted for the anti-CCR5 antibody 
or gpl2 0/sCD4 complex. In another embodiment, the agent 
could be mixed or pre-incubated with the anti-CCR5 antibody 
(or gpl20/sCD4 complex) prior to passing the mixture over 
biosensor chips as described above. 

Binding of MAbs to CCR5 

L1.2-CCR5 cells (1x10^) were incubated with anti-CCR5 MAb 
(50nM) ± peptide (lOO/iM) . I^lAb binding was detected using a 
PE-labeled goat anti-mouse antibody (Caltag Laboratories, 
Burlingam, CA) . The m.f.i value was measured by flow 
cytometry as described (Olson et al . J, Virol. 73:4145, 
1999) . MAb binding was calculated as above. 
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We determined whether the CCR5 Nt peptides could inhibit 
binding of a panel of anti-CCR5 MAbs to CCRS-" cells. PA8 
binding was reduced significantly by all wild-type peptides 
containing amino acids 2-18, regardless of tyrosine 
modification (Fig, 3) . BIAcore analysis confirmed that PAS 
similarly and specifically associated with both sulfated 
and phosphorylated peptides (Fig. 4) . Binding of PA12 to 
CCR5 was not inhibited by any of the peptides (Fig. 3) 
PAIO binding to CCR5 was inhibited only by S-3/10/14 (Fig. 

3) . PAIO was also observed to associate with bS-10/l4 and 
to a lesser extent with bP-10/l4 in BIAcore analysis (Fig. 

4) , which may be more sensitive than the gpl2 0/CCR5 -binding 
assay. Binding of 2D7 to CCR5 was not inhibited by any of 
the peptides (Fig. 3) . No significant interaction was 
observed between any CCR5 Nt peptide and Mab 2D7 (Figs. 3 
and 4) , whose epitope resides within the second 
extracellular loop on CCR5 . 

Single cycle HlV-1 entry assay 

NlluCenv particles pseudotyped with envelope glycoproteins 
from MuLV, HTLV-1 and HIV-1 strains JR-FL, HXB2, DH123, Gun- 
1 were made as described (Dragic et al . J, Virol. 72:279, 
1998) . Target cells (Hela-CD4-^CCR5* or U87-CD4-^CCR5-^) were 
incubated with virus-containing supernatant fractions 
(lOOng/ml p24) ± peptide (100/iM) for 4 h. then washed and 
resuspended in culture media. After 48 hours the cells were 
lysed and lucif erase activity (relative light units, 
r.l.u.) was measured using a standard kit (Promega, 
Madison, Wl) as described (Dragic et al . J. Virol. 72:279, 
1998). Viral entry was calculated: (r.l.u. with 
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peptide) / (r . 1 , u without peptide) xlOO%. 

The ability of different CCR5 Nt peptides to inhibit HIV-1 
entry into CD4*CCR5*CXCR4* cells was tested using a 
5 lucif erase-based single round of entry assay (5) . Only 
peptides S-10/14 and S-3/10/14 inhibited the entry of the 
R5 isolate KIV-lj^.p^ by approximately 50% in HeLa-CD4"'CCR5* 
and U87MG-CD4*CCR5'' (Fig. 5 and data not shown) . We were 
unable to inhibit the entry of the R5X4 isolates HIV-1dhi23 
10 and HIV-l(3un-i/ oi' of the X4 isolate HIV-1hxb2- The entry of 
MuLV and HTLV pseudotypes was also unaffected by the 
peptides (Fig. 5) . 

15 Screening assays 

1) HIV-1 gpl20/CD4-IgG2 

Streptavidin-coated 96-well microtiter plates (NEN Life 
Science Products, Boston, MA) are blocked with 200 /il/well 
of 5% bovine serum albumin (Sigma, St. Louis, MO) in PBS 

20 buffer and washed with assay buffer (0.5% Tween 20, 1 % 
fetal bovine serum, and 2% BSA in PBS buffer) . The plates 
are then incubated 1 hour at ambient temperature with 100 
jul/well of biotinylated CCR5 N-tei'minal sulfopeptide at a 
concentration of 500 /iM in assay buffer. Following a wash 

25 step, the plates are incubated for 1 hour at ambient 
temperature with an HIV-lj^.p^ gpl20/CD4 - IgG2 complex in the 
presence or absence of inhibitory agent. The plates are 
again washed and incubated for 3 0 minutes with a 
horseradish peroxidase-labeled goat antibody to human IgG 

30 (Kirkegaard £c Perry Laboratories, Gaithersburg , MD) 
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followed by addition of the TMB (3, 3 ',5,5'- 
tetramethylbensidine) chromogenic subst^rate (Pierce) . The 
reaction is stopped by addition of 100 /il/well of 2N H2SO4 
prior to colorimetric detection at a wavelength of 450 nm. 
5 Wells without biotinylated peptide serve as negative 
controls. The percent inhibition of binding is calculated 

[1 - (ODwith inhibitor " ^Dccntrol well) / (OD without inhibitor ~ OD^ontrol 

] X 100, where OD represents the average optical density 
observed for the indicated wells. 

10 

2) Anti-CCRS antibodies 

Streptavidin-coated micx'otiter plates are blocked and 
incubated with CCR5 N-terminal peptide as described above. 
Following a wash step, the plates are incubated for one 

15 hour at ambient temperature with the anti-CCR5 antibody 
PAIO in the presence or absence of inhibitory agent. The 
plates are again washed and incubated for 3 0 minutes with a 
horseradish peroxidase-labeled goat antibody to mouse IgG 
(Kirkegaard & Perry Laboratories, Gaithersburg, MD) 

20 followed by addition of' TMB substrate for colorimetric 
detection as described above. The percent inhibition 
mediated by the inhibitory agent is calculated as described 
above . 

25 

Discussion 

Tyrosine-modif ied peptides spanning the region of the CCR5 
Nt that contains residues important for viral entry were 
synthesized, (Dragic et a J . J. Virol. 72:279, 1998; Rabat 
30 et al . J. Virol. 72:3464, 1998; Farzan et al . J, Virol. 
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72:1160, 1998; Dorantz et al . J. Virol. 71:6305, 1997). 
Interactions between the Nt peptides and gpl20/CD4 
complexes were characterized. Peptides containing sulfo- 
tyrosines in positions 10 and 14 efficiently inhibited 
5 binding of gpl2 Ojr.fl/<=^D4 to CCR5 . Substitution of the 
sulfate groups for phosphates, which are also negatively 
charged at physiological pH, rendered the Nt peptides 
inactive. Inhibition of gpl20/CCR5 binding was dependent, 
therefore, on the presence of sulfate moieties and was not 

10 simply due to non-specific electrostatic interactions 
between the peptide and the gpl20/CD4; complex or the 
peptide and the cell surface. Inhibition of gpl20/CCR5 
binding was also dependent on the primary structure 
surrounding the sulf o-tyrosines since peptides with random 

15 sequences of CCR5 amino acids 2-18 had no inhibitory 
activity. Additional Nt amino acids in the region 2-18 were 
important for activity since a shortened peptide containing 
just amino acids 10-14 was unable to inhibit gpl20/CD4 
binding, despite the presence of two sulf o-tyrosines . It 

20 would be straightforward to define the minimum number of 
amino acids needed for activity by systematically 
synthesizing sulf opept ides intermediate in length between 
peptide 2-18 and peptide 10-14. Similarly, " sulf opept ides 
that incorporate a greater portion of the CCR5 Nt could be 

25 easily synthesized and tested for activity using the 
methods described herein. 

Qualitative BIAcore analyses allowed the demonstration of a 
highly specific, CD4 -dependent interaction between a 
30 tyrosine-sulfated Nt peptide and gpl20jR.FL- No binding of 
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the protein complex to a tyrosine -phosphorylated peptide 
was observed. Only gpl20s derived from isolates that use 
CCR5 as a co-receptor associated with the sulfated peptide. 
gpl2 0nHi23/CD4 binding was weaker than gpl 2 0jr.pl/GD4 binding, 
suggesting that envelope glycoproteins from R5X4 isolates 
have a lower apparent affinity for CCR5 than envelope 
glycoproteins from R5 isolates. gpl2 0LAi/ derived from an 
isolate that only uses CXCR4 , did not bind to the sulfated 
peptide. A V3 loop-deleted gpl20jR.pL did not associate with 
the sulfated peptide, just as this protein was unable to 
bind to full length CCR5 on the cell surface (Trkola et al . 
Nature 384:184, 1996). 

The binding of the Nt peptides to several anti-CCR5 MAbs, 
all of which recognize conformational epitopes in CCR5 and 
inhibit gpl20/CCR5 binding were also studied. PA12 and 2D7' 
did not bind to any of the peptides. Binding of PAS to the 
peptides was independent of tyrosine-modif ication whereas 
PAIO associated more with the sulf o- tyrosine-containing 
peptide than with the phospho- tyrosine-containing peptide. 
It seems, therefore, that sulf o- tyrosines and phospho- 
tyrosines are relatively interchangeable for the, purpose of 
MAb binding but that gpl2 0/CD4 binding has an absolute 
requirement for sulf o- tyrosines . Relatively subtle 
differences in size and geometry of sulfate and phosphate 
groups might be relevant for binding of the CCR5 Nt with 
gpl2 0, which must not only accept the negative charge, but 
also coordinate, probably by hydrogen bonds, the tyrosine 
sulfate oxygens. The kinetics of MAb binding to the CCR5 Nt 
peptides exhibited large apparent on rates and slow 
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10 



apparent off rates, which also differed from our 
observations of gpl20/CD4 binding kinetics. 

None of the Nt peptides inhibited MuLV, HTLV and HIV-1hxb2 
envelope-mediated viral entry, which is not mediated by 
CCR5. In contrast, peptides S-10/14 and S-3/10/14 
specifically inhibited the entry of the HIV-Ijr.pl R5 strain 
in two different cell lines. The inhibition of HIV-1 entry 
by tyrosine-sulfated peptides was partial (-50%) but 
nonetheless striking given the difficulty of blocking this 
process with short, linear peptides (Jameson et al . Science 
240:1335, 1988; Chan and Kim Cell 93:681:1998; Doranz et 
al. J. Exp. Med. 186:1395, 1997; Heveker et al . Current 
Biology 8:369, 1998; Eckert et al . Cell 99:1, 1999). 
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Second Series of Experiments 

CD4 and CCR5 mediate fusion and entry of R5 HIV-1 strains. 
Sulf otyrosine and other negatively charged residues in the 
CCR5 amino-terminal domain (Nt) are crucial for gpl20 
5 binding and viral entry. It is shown' that a soluble 
gpl20/CD4 complex specifically binds to a peptide 
corresponding to CCR5 Nt residues 2-18, with sulf otyrosines 
in positions 10 and 14. This sulfopeptide also inhibits 
soluble gpl20/CD4 binding to cell surface CCR5 as well as 

10 infection by R5 virus. These observations prompted the 
further- delineation of the determinants pf the gpl20-CCR5 
Nt sulfopeptide interaction. It is shown that residues 10- 
18 constitute the minimal domain of the CCR5 Nt that is 
able to specifically interact with soluble gl20/CD4 

15 complexes. In addition to sulf otyrosines in. positions 10 
and 14, negatively charged residues in positions 11 and 18 
participate in this interaction. Furthermore, the CCR5 Nt 
binds to a CD4- induced surface on gpl2 0 that is composed of 
conserved residues in the V3 loop stem and the C4 domain. 

20 Binding of gpl20 to cell surface CCR5 , however, is further 
influenced by variable residues in the crown of the V3 
loop. This data suggest that gpl20 docking to CCR5 is an 
interdependent, multi-step process involving different 
regions of the envelope glycoprotein and the co-receptor. 

25 

Entry of HIV-1 R5 isolates into target cells is mediated by 
the successive interaction of the envelope glycoprotein 
gpl20 with CD4 and the CCR5 co-receptor [3] . Gpl20-CD4 
complex formation generates a large bonding energy that 
30 drives reordering of the gpl20 core structure [22, 31, 47]. 
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Changes in the orientation of the V1/V2 and V3 loops, as 
well as the bridging sheet (composed of the V1/V2 stem and 
C4) , cooperatively create and/or expose a co-receptor 
binding site on gpl20 [22,37,47], The predicted co-receptor 
binding surface on gpl2 0 has a hydrophobic core surrounded 
by a positively charged periphery and is composed of both 
conserved and variable residues located in the C4 domain 
and V3 loop, with lesser contributions from the V1V2 stem 
[22, 36, 37] , 



It has been demonstrated that specific amino acids within 
the CCR5 amino- terminal domain (Nt, amino acids 2-31) , 
including negatively charged and tyrosine residues, are 
essential for CCR5-mediated fusion and entry of R5 and R5X4 
15 HIV-1 strains [5, 12, 13, 15, 35]. Farzan et al . [16] 
demonstrated that the CCR5 Nt undergoes both O- 
glycosylation and tyrosine sulfation. It is presently not 
known whether O-glycosylat ion plays a role in co-receptor 
function, but this possibility is suggested by observations 
20 that serines in the Nt are important for viral entry. 
• Inhibition of cellular sulfation pathways, including 
tyrosine sulfation, greatly decreases gpl20 binding to CCR5 
as well as the entry of R5 and R5X4 HlV-1 strains into 
target cells ([16], E.G.C. unpublished data). Post- 
25 translational sulfation of the tyrosine residues in the 
CCR5 Nt, therefore, may critically modulate the 
susceptibility of target cells to HIV-1 infection in vivo. 



It was demonstrated that a CCR5 Nt -based peptide spanning 
30 residues 2-18 and containing sulf otyrosines in positions 10 



and 14 specifically associates with soluble gpl20/CD4 
complexes containing envelope glycoproteins from R5 (JR-FL) 
and R5X4 (DH123) but not X4 (LAI) strains [11] . Peptides 
containing unmodified tyrosines or phosphotyrosines , 
■however, did not bind soluble gpl20/CD4 complexes [11]. The 
tyrosine-sulfated CCR5 Nt therefore specifically interacts 
only with gpl20 proteins from isolates that use this co- 
receptox' to gain entry into target cells. Furthermore, only 
the CCR5 Nt-based sulfopeptide inhibits binding of soluble 
gpl2 0jR.FL/CD4 to intact, cell surface-expressed CCR5 and 
moderately blocks the entry of the R5 isolate JR-FL. The 
affinity of soluble gpl20/CD4 for the CCR5 Nt sulfopeptide, 
however, is approximately 10 -100 -fold lower than for the 
native, membrane-associated co-receptor [11, 42, 46], 
suggesting that other gpl20-CCR5 contacts are required to 
consolidate this interaction. This concept is further 
suppoi'ted by studies of CCR5 chimera, as well as studies 
with inhibitors of CCR5 co-receptor function [12, 34, 38, 
32, 14] . 

A novel ELISA is reported to detect binding of 
sulf opeptides to soluble gpl20/CD4 complexes, as well as 
anti-CCR5 MAbs and chemokines. ELISA and surface plasmon 
resonance (SPR) were used to further delineate the 
determinants of the gpl20-CCR5 Nt interaction. In order to 
define the minimal domain of the CCR5 Nt capable of 
specifically binding to soluble gpl20/CD4 complexes, 
sulf opeptides corresponding to different regions of the Nt 
were analyzed. To identify the gpl20 domains involved in 
sulfopeptide binding, inhibition of gpl20/CD4 complex 
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binding to CCR5 Nt sul f opeptides by anti-gpl20 Mabs was 
studied. Residues in or near the epitopes of inhibitory 
MAbs were mutated to alanine, and the gpl2 0 point mutants 
were compared for their ability to bind to CCR5 Nt 
5 sulf opeptides and cell-surface CCR5 . The data suggest that 
a mostly conserved surface of gpl20 binds to a nine-residue 
stretch of the CCR5 Nt , whereas more variable residues in 
the crown of the V3 loop may interact with a secondary 
binding site on CCR5 . 

10 

Materials and Methods 



Reagents : CD4-IgG2, soluble CD4 (sCD4) , recombinant soluble 
gpl20s from HIV-1^, (X4), HIV-1^hi23 (R5X4), and HIV-lj^.p^, (R5) 

15 isolates, anti-gpl20 MAb PAl (directed against the V3 loop 
of JR-FL) and anti-CCR5 MAbs PAS, PAIO, PAll , PA12 , PA14 
wei-e produced by Progenies Pharmaceuticals, Inc. 
(Tarrytown, NY) as described [1, 32] . MAbs 133-290, 133- 
192, 135-9, A32, 17b, 19b, 48d, 9284, G3-42, Cll, G45-60 

20 and 2G12 were a generous gift [26] . The small -molecule CCR5 
antagonist TAK-779 was obtained as described [14] . 

Peptides corresponding to different segments of the CCR5 Nt 
were synthesized as described previously (Table 1) [11] . 
25 Sulfo- or phospho-tyrosines were incorporated in positions 
10 and 14, and all peptides carried a carboxy- terminal Gly- 
Ala-Gly spacer preceding a biotinylated lysine. Residues 
were numbered according to their positions in the full 
length CCR5 protein. 
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Surface plasmon resonance : Binding of gpl2 0/CD4-IgG2 complex 
and MAJds to CCR5 Nt-based peptides was measured as 
previously described [11] . Briefly, streptavidin-coated 
sensor chips were divided into two surfaces, each with a 
separate flow chamber. The sensor chip was conditioned and 
equilibrated as recommended by the manufacturer. 
Biotinylated peptide (400 resonance units, RU) was bound to 
the surface of the second chamber whereas the first chamber 
of the chip was used as a negative control. Gpl2 0/CD4 - IgGj 
complex (50 nM) was passed over the chip surface in the 
presence or absence of MAbs (150 mM) Surface plasmon 
resonance was monitored and displayed in RU as a function 
of time using a Biacore X. After each measurement the chip 
was regenerated and equilibx^ated as recommended by the 
manufacturer . 

Generation of qpl2 0 alanine mutants and their binding to 
CD4 - loG o : Mutant gpl20 proteins were generated using the 
QuickChange Kit from Stratagene (San-Diego, CA) . Gpl2 0jR.FL/ 
cloned into the pPPI4 expression vector [4] , served as the 
template for site directed mutagenesis. Nucleotide 
sequencing was performed to ascertain the presence of the 
appropriate mutation in the gpl20 coding sequence. 293T 
cells were calcium phosphate transfected with the different 
mutant gpl2 0 expression constructs. Supernatants containing 
soluble gpl20 proteins were harvested and cleared of debris 
by centrif ugation 24 hours post - transf ection . 

Quantification of gpl20 was performed by ELISA as 
previously described [40] ) . Briefly, 293T supernatants were 
boiled for 5 minutes and denatured gpl2 0 was captured on an 
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ELISA plate coated with D7324 (International Enzymes Inc. 
Fallbrook, CA) , a MAb that recognizes a 15 residue linear 
epitope in the carboxy- terminal end of gpl20. Captured 
gpl20 was detected by a mixture of anti-gpl20 MAbs B12 and 
5 B13 [40] , followed by incubation with a horseradish 
peroxidase-conjugated (HRP) anti -mouse IgG antibody 
(Amersham Pharmacia, Piscataway, N.J.). Optical density 
(O.D.) was measured at 450 nm using the ImmunoPure TMB 
Substrate kit (Pierce , Rockford, IL) . 

10 

CD4-IgG2 binding to non-denatured mutant gpl20 proteins also 
was measured. Plates coated with D7324 were used to capture 
native gpl2 0 from supernatants of transiently transf acted 
293T cells. CD4-IgG2 (50 nM) was added to the plates and its 
15 binding was detected using an HRP- conj ugated goat anti- 
human IgG and TMB substrate as described above. 

CCR5 Nt peptide ELISAs : Streptavidin- coat ed ELISA plates 
(Pierce, Rockford, IL) were blocked with D-PBS/ 5% BSA for 

20 2 hours at room temperature then washed three times with 
assay buffer (D-PBS/ 0.5% Tween 20/ 1% Fetal Bovine Serum/ 
2% BSA). Plates were then contacted with sulfo- or phospho- 
peptides (1 fig /ml) for 1 hour at room temperature and 
washed three times with assay buffer. Mixtures of CD4-IgG2 

25 (50 nM) and purified gpl20 or gpl20-containing supernatants 
in a 1:4 molar ratio were added to the plates for 1 hour at 
room tempex-ature . Plates were washed three times and (HRP) - 
conjugated goat anti-human IgG was used to detect the 
presence of bound CD4-IgG2. The plates were developed using 

30 the TMB substrate as described above. Gpl2 0/CD4 - IgGj binding 
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to the peptides was normalized for CD4-IgG2 binding to the 
mutant gpl20 proteins. 

In a competition ELISA, peptides were captured onto the 
5 plates as described above. Inhibitor or assay buffer was 
added for 1 h prior to addition of gpl2 0jjj_pl/CD4 -IgG2 complex 
(1 nM) for an additional h at room temperature. The assay- 
was then completed as described above. Dii^ect binding of 
anti-CCR5 murine MAbs to the peptides was examined as 
10 described above except that MAb was substituted for 
gpl20/CD4 - IgGs complex and a goat anti-mouse HRP-coupled 
antibody was used for detection. 

Binding of opl2 0 /CD4 - loG o complexes to cell-surface CCR5 : 
Ij1.2-CCR5"' cells (10^) were incubated with gpl2 0 - containing 
supernatant (lOOnM) and biotinylated CD4-IgG2 (SOnlVI) for 1 
hour at 37°C in assay buffer, as previously described [32]. 
Gpl2 0/CD4 -IgG2 bound to the cells was revealed by FACS 
analysis of the mean fluorescence of intensity (m.f .i.) 
after addition of str-eptavidin-PE (Pharmingen, San-Diego, 
CA.). Binding was calculated using the formula: (m.f.i. 
gpl20 mutants) / (m. f . i . gpl20 wild type) x 100% and 
normalized for CD4-IgG2 binding to the mutant gpl2 0 
proteins . 

Results 

An ELISA to detect binding of soluble qpl20/CD4 complexes 
to CCR5 Nt-based peptides : Surface plasmon resonance (SPR) 
was previously used to show that gpl20/sCD4 complexes 
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specifically interact with a peptide spanning CCR5 residues 
2-18 and containing sulf otyrosines in positions 10 and 14 
(2-18, Table 1) . The on and off rates of complex-pept ide 
binding were extremely x'apid and could not be measured 

5 precisely by SPR. The Kd was estimated to be in the 10"'^-10"^ 
range. Replacing monomeric sCD4 with tetravalent CD4-IgG2, 
however, lead to a dramatic shift in both on and off rates, 
lowering the Kd into the IQ-^-lO"^ range (Figure 7a) . This 
observation prompted us to develop an ELISA to directly 

10 detect complex-pept ide binding. Streptavidin-coated ELISA 
plates were used to capture biotinylated , CCR5 Nt-based 
peptides, and then further incubated with soluble 
gpl20/CD4 -IgGj complexes. Complex binding was detected using 
an HRP- conjugated goat ant i- human IgG antibody. 

15 

Sulfopeptide 2-18 bound gpl2 0jr.fl/CD4 - IgGs with an IC50 -InM, 
and gpl2 0DHi23/CD4 -IgGs with an IC50 -5nM (Figure 7b) . 
Sulfopeptide 2-18 did not measurably bind CD4-IgG2 alone or 
in complex with either gpl20LAi or V3 loop-deleted gpl20jR.FL 

20 (Figure 7b and data not shown) . No binding was observed to 
an analogous CCR5 Nt phosphopeptide (2-18 (P), Figure 12) by 
any of the gpl2 0/CD4 - IgGj complexes (Figure 7b) . Identical 
patterns of reactivity were observed for gpl20s in complex 
with CD4-Y2, a divalent CD4 -immunoglobulin fusion protein 

25 [data not shown, (1)] However, no binding was observed for 
gpl20 in complex with anti-gpl2 0 MAbs 2G12 and IgGlbl2, 
even though the latter' s epitope overlaps the CD4 binding 
site on gpl20 (data not shown) . Thus the ELISA reproduces 
the critical biological features of cell-surface CCR5/gpl20 

30 interactions, including a dependence upon CCR5 tyrosine 
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sulfation, CD4 , the V3 loop, and the coreceptor usage- 
patterns of the parent viruses. 

Using a competition ELISA, inhibition of gpl2 0jR.Fi,/CD4 - IgG2 
binding to the sulfopeptide 2-18 was enabled with the anti- 
CCR5 MAb PAS. However, binding of soluble gpl2 0/CD4 - IgGj 
complexes to the sulfopeptide was not inhibited by TAK-779, 
nor the CC-chemokines MlP-la, MIP-1|3 and RANTES even when 
used at supraphysiologic concentrations. 

Binding of CCR5 Nt peptides to anti-CCR5 MAbs and soluble 
gpl20/CD4 : ELISA was used to test the binding of a panel of 
anti-CCR5 MAbs to peptides 2-18 and 2-18 (P). We had 
previously demonstrated that MAbs PAS, PAll and PA12 bind 
epitopes in the Nt, PAIO binds an epitope that spans the Nt 
and ECIj2 and PA14 binds an epitope exclusively in ECIj2 
[32] . Here we show that PAS avidly binds peptides 2-18 and 
2-18 (P) (Figure 2). PAIO binds avidly to 2-18 and 
moderately to 2-18 (P), PAll binds moderately to 2-18 and 
weakly to 2-18 (P). PA12's binding is weak for 2-18 and 
undetectable for 2-18 (P) , Finally, PA14 does not recognize 
either the sulfopeptide or the phosphopeptide (Figure 8) . 
Furthermore, PAS binds similarly to all of the CCR5 Nt- 
based sulf opept ides in Figure 12 (data not shown) . 

In order to more precisely delineate the minimal CCR5 Nt 
domain that specifically binds to soluble gpl2 0/CD4 
complexes, a panel of sulf opeptides spanning different 
stretches of the CCR5 Nt were synthesized (Figure 12) . All 
of the peptides carried sulf otyrosines in positions 10 and 
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14 since we previously showed that these are required for 
complex-peptide binding. Binding of gpl 2 Ojr.pl/CD4 - IgG2 to the 
different sulf opept ides was tested by ELI SA. Although the 
strongest binding was observed using longest sulf opeptide, 
5 '2-18, significant binding for peptide 10-18, which 
demonstrated -3 -fold lower avidity was also observed 
(Figure 9) . Peptides 8-15, 6-16 and 10-15 bound the soluble 
complex at least ten-fold less avidly than 2-18. (it was 
previously shown that a sulfopeptide consisting of residues 
10 10-14 did not bind soluble gpl20/CD4 complexes.) 
Furthermore, the gpl2 Ojr.pl/CD4 - IgG2 complex only weakly 
bound to peptide 10-18 carrying two alanine mutations in 
positions 11 and 18, Previous mutagenesis studies have 
shown that residues Asp- 11 and Glu-18 are important for 
15 fusion, entry and gpl20-CCR5 binding [12, 13] . Finally, it 
should be noted that the same binding patterns to the 
different sulf opeptides were observed with soluble 
complexes containing gpl2 0^^123 (data not shown) . 

20 Inhibition of qpl20/CD4 binding to CCR5 Nt sulf opeot ides by 
anti-qpl20 MAbs : In order to determine which domains of 
gpl20 were involved in binding to CCR5 Nt-based 
sulf opeptides, the ability of a panel of well -chai^acterized 
anti-gpl20 MAbs [25] to inhibit binding of either the 

25 gpl20^.pL/CD4-IgG2 or the gpl2Ci,Hi23/CD4 - IgG2 complex to the 
2-18 sulfopeptide was tested. Only MAbs directed against 
CD4- induced (CD4i) epitopes and the V3 loop were capable of 
inhibiting binding of the gpl2 0/CD4 - IgG2 complex to the 
CCR5 sulfopeptide. Inhibition of gpl20ji,.pL and gpl20DHi23 

30 binding by MAbs 17b and 48d was >90%. The anti-V3 loop MAb 
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19b [28] , which recognizes an epitope in the crown of the 

V3 loop (sequence -I G--FY-T) and is reactive with R5 

strains, inhibited gpl2 0jR.FL binding >90% and gpl2 0DHi23 
binding by approximately 80%. Anti-V3 loop MAb PAl , which 
5 was raised against gpl20jH.pL (W.C.O., unpublished -results) 
efficiently inhibited binding of gpl2 0jR.pL but not gpl20i:jHi23- 
Finally, anti-V3 loop MAb 9284 [21] , which recognizes an 
epitope spanning residue 307 to 330 in the V3 loop of X4 
strains, was unable to inhibit binding of either gpl20 

10 protein to the sulf opeptide . MAbs directed against other 
epitopes in other constant and variable .regions of gpl20 
also had no effect on binding of the soluble complex to the 
peptides. Similar results were obtained when the anti-gpl20 
MAbs were used to inhibit soluble complex binding to cell 

15 surface CCR5 (data not shown) . 



Rinding of mutant solnb! ^ a-Dl2Q/CD4 complexes to — CCR5 Nt 

sulf ooeptides : Numex'ous studies have shown that x-esidues in 
the V3 loop determine co-receptor usage and binding [6-10, 
20 18, 20-21, 23-24, 27, 29, 33, 41-44, 46], The crystal 
structure of a gpl20 lacking the V1/V2 and V3 loops in 
complex with sCD4 and the 17b MAb further implicated a 
conserved, CD4i surface on gpl20, adjacent to the V3 loop, 
in co-receptor binding [36, 37] . Single alanine mutants of 
25 all of the residues near or within regions previously shown 
to be important for co-receptor usage were generated. These 
gpl20 mutants were tested for their ability to bind to the 
CCR5"based sulfopeptide 2-18 as well as to cell surface 
CCR5 . Binding was nox^malized for gpl20 mutant binding to 
CD4-IgG2- Wild-type levels of binding were observed for all 
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mutants except W427A, .R440A and R469A, which bound CD4-IgG2 
with 5- 10- fold lower but nonetheless measurable avidity. 



Residues in both strands of the V3 loop stem were found to 
5 be involved in gpl20 binding to the 2-18 sulf opept ide : 
Alanine mutants of residues R298, N301, T303, 1322, D324, 
1325 and R326 were found to decx'ease complex binding to the 
peptide by >10-fold. Residues in the crown of the V3 loop, 
including the GPGR motif, however, had no effect on gpl2 0 

10 binding to the sulf opeptide . C4 residues in or adjacent to 
the two C- terminal p- strands of the bridging sheet were 
also shown to participate in binding to the sulf opept ide : 
..Alanine substitutions of R419, 1420, K421, Q422 and R444 
decreased complex binding to the sulf opeptide by 5-10-fold. 

15 None of the alanine substitutions that we introduced in the 
other regions of gpl20 significantly affected complex- 
peptide-interactions . 

It was furthermore demonstrated that additional gpl2 0 
20 residues are involved in complex binding to cell surface 
• expressed CCR5 . Alanine substitutions of residues S306, 
G310, P311, R313, F315, Y316 in the crown of the V3 loop 
decreased complex binding to CCR5 by 5-10-fold. 
Furthermore, alanine substitutions of several residues in 
25 CI, C2 and C3 also had a moderate effect on complex binding 
to CCR5 . Finally, it is noted that alanine substitutions of 
R440 and R469 incz'eased complex binding to both 2-18 and 
CCR5, whereas substitutions of E320 and W427 increased 
complex binding to CCR5 only. 
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Discussion 

Tyrosine- sulfated CCR5 Nt peptides were studied for binding 
to soluble gpl20/CD4 complexes as well as anti-CCR5 MAbs , 
CC- chemokines and TAK-779 using a novel solid-phase ELISA. 
5 ■ Inhibition of peptide-complex interactions by anti-gpl20 
MAbs was explored by surface plasmon resonance. These Mabs 
were also tested for their ability to inhibit complex 
binding to cell surface CCR5 . In addition, a panel of 
gpl2 0 point mutants were generated and then their 

10 reactivity was compared with CCR5 Nt peptides and cell 
surface CCR5 . The principal conclusions are that (1) 
residues 10 to 18 of the CCR5 Nt may define the minimum 
recognition site for gpl20, (2) gpl20 binding to the CCR5 
Nt depends on highly conserved residues located in the C4 

15 domain and the stem of the V3 loop, and (3) gpl2 0 binding 
to cell surface CCR5 depends on a broader region that 
includes residues in the crown of the V3 loop, CI, C2 and 
C3 . The findings suggest that distinct domains of gpl20 
and CCR5 bind in a multi-step fashion and raise questions 

20 about the determinants of specificity of the co-receptor- 
gpl2 0 interaction , 

An ELISA was developed to detect complex-peptide binding 
based on the observation that the tetravalent gpl20/CD4 - IgG^ 

25 complex binds to CCR5 Nt sulf opeptides ten-to a hundred - 
fold more avidly than the monovalent gpl20/sCD4 complex. 
Complex- sulf opeptide binding was only observed with gpl2 0 
proteins derived from R5 and R5X4 , but not X4 HIV-1 
strains. V3 loop deleted gpl20jR_PL failed to bind to the 

30 sulf opeptides . Phosphopeptides did not bind to any of the 
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soluble complexes. Thus, the ELISA reproduces the critical 
biological features of cell-surface CCR5-gpl20 

interactions, including a dependence upon CD4 , CCR5 
tyrosine sulfation, the V3 loop and the co-receptor usage 
5 patterns of the parental vii'uses . 

CCR5 Nt phosphopeptides and sulf opeptides were 
differentially recognized by anti-CCR5 MAbs in ELISA. PA8 
possessed equal avidity for sulfated and phophorylated 

10 peptides, implying that its epitope does not include 
tyrosine side chains. PAIO and PAll preferentially 

recognized the sulf cpept ide , albeit with varying 
efficiencies, suggesting that sulf otyrosines participate 
either directly in peptide-MAb interactions or indirectly 

15 by influencing epitope conformations. PA12 only interacted 
with the sulfopeptide and PA14 did not bind either Nt 
peptide. Ir was pi-eviously shown that both of these MAbs 
recognize discontinuous epitopes comprising residues in the 
Nt and ECL2 of CCR5 . The observations now imply that ECIi2 

20 residues are marginal for PA12 binding and essential for 
PA14 binding to CCR5 . Finally, binding of soluble 

gpl20/CD4 complexes to CCR5 Nt peptides couls be completed 
with an anti-CCR5 Mab but not with either CC-Chemokines or 
TAK-779, whose binding sites have been mapped to other 

25 regions of CCR5 (14, 39) . Both CC- chemokines and TAK-779, 
however, are able to compete with gpl20/CD4 binding to cell 
surface CCR5 , perhaps through steric or comf ormational 
mechanisms. (14, 42, 46). It is noted that Farzan et al . 
reported that a CCR5 Nt sulfopeptide spanning residues 1-22 

30 partially blocks MlP-la binding to cell-surface CCR5 and we 
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attribute the discrepancy to differences in peptides and 
assays (17) . 

In order to more precisely delineate the gpl20 binding site 
5 in the CCR5 Nt , an ELISA was used to test binding of 
soluble complexes to several CCR5 Nt sulf opeptides . 10-18 
was the smallest sulfopeptide that avidly bound soluble 
gpl20/CD4 complexes and may define the minimum docking site 
for gpl20 on CCR5 . In addition to the two sulf ©tyrosines 

10 in positions 10 and 14, negatively charged amino acids Dll 
and E18 were found to be critical fo:r complex-peptide 
binding. It was concluded that a cluster of negative 
charges in the CCR5 Nt appears to represent the principal 
recognition motif for gpl20, although residues 2 to 9 

15 further contribute to binding. Similar patterns of peptide 
reactivity were observed for recombinant gpl2 0s derived 
from HIV-ljR.pL (R5) and HIV-1d„i23 (R5X4) , suggesting that the 
CCR5 Nt sulf opeptides recognize conserved structures in the 
envelope glycoprotein. Gp12 0dhi23/ however, bound about 

20 five- fold less that gpl20jR.FL to the sulf opeptides , which 
probably accounts for its less efficient usage of CCR5 
(13) . 

Anti-gpl20 lyiAbs were tested for their ability to inhibit 
25 gpl20/CD4 binding to sulf opeptides or to cell surface CCR5 . 
A number of anti-gpl2 0 MAbs directed against conserved and 
variable regions of the envelope glycoprotein were not 
inhibitory. Only Mabs 48d and 17b, directed against CD4i 
epitopes, and 19b and PAl, directed against the V3 loop, 
30 efficiently inhibited gpl20 binding to the 2-18 



sulfopeptide and to cell surface CCR5 . The CD4i epitope 
was previously shown to participate in co-receptor binding 
and residues in the V3 loop primarily determine co-receptor 
specificity (36, 37) . The results now suggest that these 
regions of gpl20 determine its association with the CCR5 
Nt . It is noted that inhibition of peptide -complex binding 
by 19b, which recognizes an epitope in the V3 crown, is 
inconsistant with the finding by gpl20 mutagenesis 
experiments that residues in the V3 loop crown do not 
participate in complex-peptide binding. This leads to a 
conclusion that the inhibitory effect of ■19b may be steric 
hindrance . 

In order to determine more precisely the. regions of gpl20 
that modulate the gpl20-CCR5 itner-acticn , the binding of a 
panel of gpl20 point mutants to the CCR5 Nt sulfopeptide 
and to cell surface CCR5 was tested. The mutants were 
created by the intx-oduction of single alanine substitutions 
near or within regions previously shown to be important for 
the integrity of the CD4i epitope and/or CCR5 binding (36, 
37) . Highly conserved residues in C4 and the V3 loop stem, 
including for arginines and a lysine, were found to affect 
binding of gpl2 0 to the CCR5 Nt sulfopeptide (Figure 13) . 
These residues are located in two random coil segments of 
C4 that • straddle the V3 loop stem and may constitute a 
positively charged CCR5 Nt binding domain (22) . 
Additional, conserved residues in the crown of the V3 loop, 
CI, C2 and C3 contribute to gpl20 binding to cell surface 
CCR5 (Figure 13) . It remains to be determined whether 
these, residues interact with other extracellular domains of 
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CCR5 or whether they influence the conformation of C4 and 
the V3 loop stem in a way that is only relevant in the 
context of gpl20/CD4 binding to cell surface CCR5 . It is 
unlikely that these residues also interact with the Nt in 
5 the context of cell surface CCR5 because they are 
relatively distal from the C4 and V3 residues that were 
implicated in sulfopeptide binding (22) . 



To date, several lines of evidence suggest that gpl20 binds 

10 to more that one region of the CCR5 co-receptor: (1) the 
affinity of gpl20s/CD4 for the CCR5 Nt sulfopeptide is 
approximately 10-100-fold lower that for the native, 
membrane-associated co-receptor (11, 42, 46), (2) co- 
receptor chimera studies implicate the extracellular loops 

15 in viral fusion and entry (2, 12, 34, 38) and (3) 
inhibitors of CCR5 co-receptor function such as Mabs 
and PA14, as well as TAK-779 do not bind to the CCR5 Nt yet 
block gpl20/CD4 binding to CCR5 (14, 32). The present 
findings could be interpreted to support a distributed 

20 model for gpl20-CCR5 interactions that mirrors the two-site 
paradigm proposed for the interaction of certain chemokines 
with their receptors (30, 45) . In this model, binding is 
initially driven by electrostatic interactions between 
negatively charged residues in the receptor Nt and basic 

25 surfaces on the chemokine ligand. This binding serves to 
orient the ligand and promote its interactions with other 
portions of the chemokine receptor. The V3 loop crown may 
form initial electrostatic interactions with the 
extracellular loops of CCR5 , which would allow the CCR5 Nt 

30 to bind to a conserved region of gpl20 comprising residues 



in C4 and the V3 loop stem. Alternatively, the CCR5 Nt 
could first bind the C4/V3 stem domain, which would them 
promote an interaction of the V3 loop with some other 
region of CCR5 . All of these interactions involve 

additional gpl20 residues that we have yet to identify. 
The role of the putative second interaction is unclear but 
it may further stabilize the gpl20-CCR5 interaction, 
optimally orienting the fusion apparatus, or triggering 
gp41 conformational changes that are required for fusion. 

The findings present us with a seeming paradox wherein nine 
residues of the CCR5 Nt confer specificity on the CCR5- 
gpl20 interaction by binding to gpl20 residues that are 
highly conserved among clade B isolates, regardless of 
their co-receptor usage. However, although the C4 and V3 
stem residues themselves are conserved, their precise 
placement may differ for R5 and X4 viruses. Clearly, 
relatively minor differences in the orientation, exposure 
or relative positioning of these widely separ^ated residues 
could abrogate binding to a short peptide but not a MAb 
(e.g., 17b) possessing a larger, more distributed binding 
site (37). In addition, more variable amino acids (e.g., 
324) within or outside the C4/V3 loop stem may contribute 
to the specificity of the gpl20-Nt interaction, and we 
showed that residues N279, R313, P369 and R444 participate 
in gpl20/CD4 binding to cell . surface CCR5 but not to CCR5 
Nt sulf opeptides . Future studies employing additional 

gpl20 mutants together with CCR5 mutants and CXCR4 -based 
sulf opeptides will shed light on the specificity 
determinants of the gpl20-co-x-eceptor interaction. 
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CD4 and CCR5 mediate fusion and entey of R5 HIV-1 strains, 
Sulf ©tyrosines and negatively charged residues in the CCR5 
Nt are crucial for binding of gpl2 0 and viral antry. 
Soluble gpl20/CD4 complexes specifically bind to CCR5 Nt 
peptides containing sulf otyrsinces in positions 10 and 14. 
CCR5 Nt sulfotyrosines inhibit gpl20/CD4 binding to CCR5 as 
well as viral entry. Residues in the V3 loop and the C4 
region og gpl2 0 compose a binding site for the CCR5 amino 
terminal domain. Redisues 10-18 of the CCR5 Nt constitute a 
minimal binding domain for gpl20: sulfotyrosines YIO and 
Y14 and negatively chaz^ged residues Dll and E18 are 
important for binding. The CCR5 Nt. terninal binding site on 
gpl2 0 is composed mostly of residues in the V3 loop stem 
and the C4 domain. 
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is claimed: 

A compound comprising the structure: 
eaYDINYYTSE3A 

whex^ein each T represents a threonine, each S 
repz^esents a serine, each E represents a glutamic 
acid; each Y represents a tyrosine; each D represents 
an aspartic acid, each I represents an isoleucine; and 
each N represents an asparagine; 

wherein a repi-esents from 0 to 9 amino acids, with the 
proviso that if there are. more than 2 amino acids ^ 
they are joined by peptide bonds in consecutive order 
and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the I at position 
9 and extending therefrom in the amino terminal 
direction; 

wherein (3 represents from 0 to 13 amino acids, with 
the proviso that if there are more than 2 amino acids, 
they are joined by peptide bonds in consecutive order 
and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the P at position 
19 and extending therefrom in the carboxy terminal 
direction; 

whex^ein 6 represents an amino group or an acetylated 
amino group; wherein X represents a carboxyl group or 
an ami dated carboxyl group; 

wherein all of a , Y, D , I , N, Y, Y , T, S , E and g are joined 
together by peptide bonds; 

further pi'ovided that at least two tyrosines in the 
compound are sulfated. 
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2. The compound of claim 1/ wherein 3 represents less 
than 17 amino acids. 

3. The compound of claim 1, wherein 3 represents less 
5 than 16 amino acids. 

4. The compound of claim 1, wherein 3 represents less 
than 15 amino acids. 

10 5. The compound of claim 1, wherein 3 represents less 
than 14 amino acids. 

6. The compound of claim 1, wherein 3 i^epresents less 
than 13 amino acids. 

15 

7. The compound of claim 1, wherein 3 represents less 
than 12 amino acids . 

8. The compound of claim 1, wherein 3 represents less 
20 than 11 amino acids. 

9. The compound of claim 1, wherein 3 represents less 
than 10 amino acids. 

25 10. The compound of claim 1, wherein 3 represents less 
than 9 amino acids . 

11. The compound of claim 1, wheorein 3 represents less 
than 8 amino acids . 

30 
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12. The compound of claim 1, wherein 3 represents less 
than 7 amino acids. 

13. The compound of claim 1, wherein 3 represents less 
5 than 6 amino acids. 

14. The compound of claim 1, wherein |3 represents less 
than 5 amino acids. 

10 15. The compound of claim 1, wherein (3 represents less 
than 4 amino acids. 

16. The compound of claim 1, wherein p represents less 
than 3 amino acids . 

15 

17. The compound of claim 1, wherein 3 represents less 
than 2 amino acids. 

18. The compound of claim 1, wherein 3 represents less 
20 than 1 amino acid. 

19. The compound of claim 1, wherein a represents less 
than 9 amino acids. 

25 20. The compound of claim 1, wherein a represents less 
than 8 amino acids . 

21. The compound of claim 1, wherein a represents less 
than 7 amino acids . 
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22. The compound of claim 1, wherein a represents less 
than 6 amino acids. 

23. The compound of claim 1, wherein a represents less 
than 5 amino acids. 



24. The compound of claim 1, wherein a represents less 
than 4 amino acids. 

10 25. The compound of claim 1, wherein of represents less 
than 3 amino acids. 

26. The compound of claim 1, wherein a represents less 
than 2 amino acids. 

15 

27. The compound of claim 1, wherein a represents less 
than 1 amino acid. 



28. A composition comprising the compound of claim 1 and a 
20 detectable marker attached thereto. 

29. The composition of claim 28, wherein the detectable 
marker is biotin. 



25 30. The composition of claim 28, wherein the detectable 
marker is attached at the C- terminus of the compound. 

31. A composition which comprises a carrier and an amount 
of the compound of claim 1 effective to " inhibit 
30 binding of HIV-1 to a CCR5 receptor on the sur-face of 
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a CD44- cell. 

A method of inhibiting human immunodeficiency virus 
infection of a CD4h- cell which also carries a CCR5 
receptor on its surface which comprises contacting the 
CD4+ cell with an amount of the compound of claim 1 
effective to inhibit binding of human immunodeficiency 
virus to the CCR5 receptor so as to thereby inhibit 
human immunodeficiency virus infection of the CD4+ 
cell . 

The method of claim 32, wherein the CD4+ cell is 
present in a subject and the contacting is effected by 
administering the compound to the subject. 

A method of preventing CD4+ cells of a subject from 
becoming infected with human immunodeficiency virus 
which comprises administering to the subject an amount 
of the compound of ::laim 1 effective to inhibit 
binding of human a. rnunodef iciency virus to CCR5 
'receptors on the surface of the CD4+ cells so as to 
thereby prevent the subject's CD4+ cells from becoming 
infected with human immunodeficiency virus. 

A method of treating a subject whose CD4+ cells are 
infected with humian immunodeficiency virus which 
comprises administering to the subject an amount of 
the compound of claim 1 effective to inhibit binding 
of human immunodeficiency virus to CCR5 receptors on 
the surface of the subject's CD4-i- cells so as to 
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thereby treat the subject. 

The method of any one of claims 3 3-35, wherein the 
compound is administered by aerosol; intravenous, oral 
or topical route. 



37. The method of claim 33 or 35, wherein the subject is 
infected with HIV-1 prior to administering the 
compound to the subject. 

10 

38. The method of claim 33 or 34, wherein the subject is 
not infected with HIV-1 prior to administering the 
compound to the subject. 

15 39. The method of claim 38, wher-ein the subject is not 
infected with, but has been . exposed to, human 
immunodeficiency virus. 

40. The method of any one of claims 33-35, wherein the 
20 effective amount of the compound comprises from about 

1,0 ng/kg to about 100 mg/kg body weight of the 
subject. 

41. The method of claim 40, wherein the effective amount 
25 of the compound comprises from about 100 ng/kg to 

about 50 mg/kg body weight of the subject. 
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The method of claim 41, wherein the effective amount 
of the compound comprises from about 1 ptg/kg to about 
10 mg/kg body weight of the subject- 



The method of claim 42, wherein the effective amount 
of the compound comprises from about 100 /xg/kg to 
about 1 mg/kg body weight of the subject. 

The method of any one of claims 33-35, wherein the 
subject is a human being. 

A method of identifying an agent which inhibits 
binding of a CCR5 ligand to a CCR5 receptor which 
comprises : 

(a) immobilizing the compound of claim 1 on a solid 
support ; 

(b) contacting the immobilized compound from step (a) 
with sufficient detectable CCR5 ligand to 
saturate all binding sites for the CCR5 ligand on 
the immobilized compound under conditions 
permitting binding of the CCR5 ligand to the 
immobilized compound so as to form a complex; 

(c) removing any unbound CCR5 ligand; 

(d) contacting the complex from step (b) with the 
agent ; and 

(e) detecting whether any CCR5 ligand is displaced 
from the complex, wherein displacement of 
detectable CCR5 ligand from the complex indicates 
that the agent binds to the compound so as to 
thei'eby identify the agent as one which inhibits 
binding of the CCR5 ligand to the CCR5 receptor. 

A method of identifying an agent which inhibits 
binding of a CCR5 ligand to a CCR5 receptor which 
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comprises : 

(a) contacting the compound of claim 1 with 
sufficient detectable CCR5 ligand to saturate all 
binding sites for the CCR5 ligand on the compound 

5 ' under conditions permitting binding of the CCR5 

ligand to the compound so as to form a complex; 

(b) removing any unbound .CCR5 ligand; 

(c) measuring the amount of CCR5 ligand which is 
bound to the compound in the complex; 

10 (d) contacting the complex from step (a) with the 

agent so as to displace CCR5 ligand from the 
complex; 

(e) measuring the amount of CCR5 ligand which is 
bound to the compound in the presence of the 

15 agent; and 

(f) comparing the amount of CCR5 ligand bound to the 
compound in step (e) with the amount measured in 
step (c) , wherein a reduced amount measured in 
step (e) indicates that the agent binds to the 

20 compound so as to thereby identify the agent as 

one which inhibits binding of the CCR5 ligand to 
the CCR5 receptor. 

47. A . method of identifying an agent which inhibits 
25 binding of a CCR5 ligand to a CCR5 receptor which 

comprises: ^ 

(a) immobilizing the compound of claim 1 on on a 
solid support; 

(b) contacting the immobilized compound from step (a) 
30 with the agent and detectable CCR5 ligand under 
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conditions permitting binding of the CCR5 ligand 
to the immobilized compound so as. to form a 
complex; 

(c) removing any unbound CCR5 ligand; 

(d) measuring the amount of detectable CCR5 ligand 
which is bound to the immobilized compound in the 
complex; 

(e) measuring the amount of detectable CCR5 ligand 
which binds to the immobilized compound in the 
absence of the agent; 

(f) comparing the amount of CCR5 i ligand which is 
bound to the immobilized compound in step (e) 
with the amount measured in step (d) , wherein a 
reduced amount measured in step (d) indicates 
that the agent binds to the compound or CCR5 
ligand so as to thereby identify the agent as one 
which inhibits binding of the CCR5 ligand to the 
CCR5 receptor. 

The method of claim 47, wherein the amount of the 
detectable ligand in step (a) and step (e) is 
sufficient to saturate all binding sites for the CCR5 
ligand on the compound. 

A method of identifying an agent which inhibits 
binding of a CCR5 ligand to a CCR5 receptor which 
comprises : 

(a) contacting the compound of claim 1 with the agent 
and detectable CCR5 ligand under conditions 
permitting binding of the CCR5 ligand to the 
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compound so as to form a complex; 

(b) removing any unbound CCR5 ligand; 

(c) measuring the amount of detectable CCR5 ligand 
which is bound to the compound in the complex; 

(d) measuring the amount of detectable CCR5 ligand 
which binds to the compound in the absence of the 
agent ; 

(e) comparing the amount of CCR5 ligand which is 
bound to the compound in step (c) with the amount 
measured in step (d) , wherein a reduced amount 
measured in step (c) indicates that the agent 
binds to the compound or CCR5 ligand so as to 
thereby identify the agent as one which inhibits 
binding of the CCR5 ligand to the CCR5 receptor. 

The method of claim 49, wherein the amount of the 
detectable ligand in step (a) and step (d) is 
sufficient to saturate all binding sites for the CCR5 
ligand on the compound. 



The method of any one of 
detectable CCR5 ligand is 
marker . 



claims 45-50, wherein the 
labeled with a detectable 



A method of identifying an agent which inhibits 
binding of a CCR5 ligand to a CCR5 receptor which 
comprises : 

a) immobilizing the compound of claim 1 on a solid 
support ; 

b) contacting the immobilized compound from step a) 
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with the agent dissolved or suspended in a known 
vehicle and measuring the binding signal 
generated by such contact; 

c) ^contacting the immobilized compound from step a) 
5 with the known vehicle in the absence of the 

compound and measuring the binding signal 
generated by such contact; 

d) comparing the binding signal measured in step b) 
with the binding signal measured in step c) , 

10 wherein an increased amount measured in step b) 

indicates that the agent binds to the compound so 
as to thereby identify the agent as one which 
binds to the CCR5 receptor. 

15 53. The method of claim 52, whex'ein the solid support is a 
surface plasmon resonance sensor chip. 

54. The method of claim 52 or 53, wherein the binding 
signal is measured by sur-face plasmon resonance. 

20 

55. A method of obtaining a composition which comprises: 

(a) identifying a compound which inhibits binding of 
a CCR5 ligand to a CCR5 receptor according to the 
method of any one of claims 45-50 and 52; and 
25 (b) admixing the compound so identified or a homolog 

or derivative thereof with a carrier. 
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The method of any one of claims 45-50 and 52, wherein 
the CCR5 ligand is a complex comprising an HIV-1 
envelope glycoprotein and a CD4 -based protein. 
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The method of claim 56, wherein the HIV-1 envelope 
glycoprotein is gpl20, gpl40 or gpl60. 

The method of claim 56, wherein the CD4-based protein 
is soluble CD4 or CD4-IgG2. 

The method of any one of claims 45-50 and 52, wherein 
the CCR5 ligand is a chemokine . 

The method of claim 59, wherein the chemokine is 
RANTES, MlP-la or MIP-1&. 

The method of any one of claims 45-50 and 52, wherein 
the CCR5 ligand is an antibody. 

The method of claim 61, wherein the antibody is 
selected from the group consisting of PAS (ATCC 
Accession No. HB-12605) , PAIO (ATCC Accession 
No. 12607), PAll (ATCC Accession No.. HB-12608), PA12 
(ATCC Accession No. HB-12609) . 

The method of claim 45 or 47, wherein the solid 
support is a microtiter plate well, a bead or surface 
plasmon resonance sensor chip. 

A compound having the structure : 

A- (aYDINYYTSEpX) „ 
wherein each T represents a threonine, each S 
represents a serine, each E represents a glutamic 
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acid, each Y represents a tyrosine; each D represents 
an aspartic acid, each I represents an isoleucine; and 
each N represents an asparagine; 

wherein a represents from 0 to 9 amino acids, with the 
proviso that if there are more than 2 amino acids, 
they are joined together by peptide bonds in 
consecutive order and have a sequence identical to the 
sequence set forth in SEQ ID NO: 1 beginning with the 
I at position 9 and extending therefrom in the amino 
terminal direction; 

wherein |3 represents from 0 to 13 amino acids, with 

the proviso that if there are more than 2 amino acids, 

I 

they are joined together by peptide bonds in 
consecutive order and have a sequence identical to the 
sequence set forth in SEQ ID NO : 1 beginning with the 
P at position 19 and extending therefrom in the 
carboxy terminal direction; 

wherein A represents a carboxyl group or an amidated 
carboxyl group ; 

wherein all of a, Y, D, I , N, Y, Y, T, S , E and p are joined 
together by peptide bonds, 

further provided that at least two tyrosines in the 
compound are sulfated, 

wherein n is an integer from 1 to 8, A is a polymer, 
and the solid line represents up to 8 linkers which 
attach the structure in parentheses to A. 

A compound having the structure: 
(GaYDINYYTSEp) n-A 

wherein each T represents a threonine, each S 



represents a serine, each E represents a glutamic 
acid; each Y represents a tyrosine; each D represents 
an aspartic acid, each I represents an isoleucine; and 
each N represents an asparagine; 

wherein a represents from 0 to 9 amino acids, with the 
proviso that if there are more than 2 amino acids, 
they are joined together ' by peptide bonds in 
consecutive order and have a sequence identical to the 
sequence set forth in SEQ ID NO: 1 beginning with the 
I at position 9 and extending therefrom in the amino 
terminal direction; 

wherein (3 represents from 0 to 13 amino acids, with 
the proviso that if there are more than 2 amino acids, 
they are joined together by peptide bonds in 
consecutive order and have a sequence identical to the 
sequence set forth in SEQ ID NO: 1 beginning with the 
P at position 19 and extending therefrom in the 
carboxy terminal direction; . 

wherein 6 represents an amino group or an acetylated 
amino group; 

wherein all of a, Y , D, I , N, Y , Y , T , S , E and p are joined 
together by peptide bonds, 

further provided that at least two tyrosines in the 
compound are sulfated, 

wherein n is an integer from 1 to 8 , A is a polymer, 
and the solid line represents up to 8 linkers which 
attach the structure in parentheses to A. 

The compound of claim 64 or 65, wherein the polymer is 
selected from the group consisting of a linear "lysine 



polymer, a branched lysine polymer, a linear arginine 
polymer, a branched arginine polymer, polyethylene 
glycol, a linear acetylated lysine polymer, a branched 
acetylated lysine polymer, a linear chloroacetylated 
lysine polymer and a branched chloroacetylated lysine 
polymer . 

The compound of claim 1, whei'ein the compound is a 
peptide which comprises consecutive amino acids having 
the sequence YDINYYTSE. 

The compound of claim 67, wherein the tyrosines at 
positions 1 and 5 of the sequence YDINYYTSE are 
sulfated. 

A compound comprising the structure: 

eaYDnnynnnEpA 

wherein each E represents a glutamic acid, and each Y 
represents a tyrosine; 

wherein a represents from 0 to 9 amino acids, with the 
proviso that if there are more than 2 amino acids , 
they are joined by peptide bonds in consecutive order 
and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the I at position 
9 and extending therefrom in the amino terminal 
direction; 

wherein 3 represents from 0 to 13 amino acids, with 
the proviso that if there are more than 2 amino acids, 
they are joined by peptide bonds in consecutive order 
and have a sequence identical to the sequence set 
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forth in SEQ ID NO : 1 beginning with the P at position 
19 and extending therefrom in the carboxy terminal 
direction; 

wherein 0 represents an amino group or an acetylated 
5 amino group; wherein A represents a carboxyl group or 

an amidated carboxyl group; 
wherein 11 represents any amino acid, 

wherein all of a, Y, D, IT, IT, Y, n, n, n, E and 3 are joined 
together by peptide bonds; 



10 further provided that at least two tyrosines in the 

compound are sulfated. 

70. The compound of claim 69, wherein the compound is a 
peptide which comprises consecutive amino acids have 

15 the sequence YDIinYIIIinE . 

71. The compound of claim 70, wherein the tyrosines at 



positions 1 and 5 of the sequence YDnJlYnnriE are 
sulfated . 

20 

72. A compound comprising the structure: 
eaYDINYYTSEpX 

wherein each T represents a threonine, each S 
represents a serine, each E represents a glutamic 
25 acid, each Y represents a tyrosine; each D represents 

an aspartic acid, each I represents an isoleucine; and 
each N represents an asparagine; 

wherein a represents from 0 to 9 amino acids, with the 
proviso that if there are more than 2 amino acids, 
30 they are joined by peptide bonds in consecutive order 



and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the I at position 
9 and extending therefrom in the amino terminal 
direction; 

wherein |3 represents from 0 to 13 amino acids, with 
the proviso that if there are more than 2 amino acids, 
they are joined by peptide bonds in consecutive order 
and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the P at position 
19 and extending therefrom in the- carboxy terminal 
direction; I 

wherein 9 represents an amino group, or an acetylated 
amino group; wherein X represents a carboxyl group or 
an amidated carboxyl group; 

wherein all of a , Y , D, I , N, Y , Y, T, S , E and (3 are joined 
together by peptide bonds; 

further provided that at least two tyrosines in the 
compound are sulfated, 

wherein any amino acid except for the Y at position 1, 
D at position 2', Y at position 5 and E at position 9 
may be replaced with a homologous amino acid. 

The compound of claim 72, wherein any I amino acid 
residue is be replaced with a G,A,V or L amino acid 
residue . 

The compound of claim 72, wherein any N amino acid 
residue is replaced with a Q amino acid residue. 



The compound of claim 72, wherein any Y amino acid 



-133- 



residue is replaced with an F or W amino acid residue. 

76. The compound of claim 72, wherein any T amino acid 
residue is replaced with an S amino acid residue. 

5 

77. The compound of claim 72, wherein any S is replaced 
with a T amino acid residue. 

78. The compound of claim 72, wherein any C is replaced 
10 with an S, T, A, G, N, or Q amino acid residue. 
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SEQUENCE LISTING 

<110> Progeiaics Phannaceuticals, Inc., et al. 

<120> SULFATED CCR5 PEPTIDES FORHTV-l INFECTION 

<130> 2048/61 01 0-A-PCT/JPW/SHS/AX 

<140> NOT YET KNOWN 

<141> 2001-02-28 

<160> 17 

<170> Patentin version 3.0 

i 

<210> 1 , 
<211> 352 
<212> PRT 
<213> human 
<400> 1 

Met Asp Tyr Gin Val Ser Ser Pro He Tyr Asp He Asn Tyr Tyr Thr 
15 10 15 

Ser Glu Pro Cys Gin Lys He Asn Val Lys Gin He Ala Ala Arg Leu 
20 25 30 

Leu Pro Pro Leu Tyi" Ser Leu Val Phe He Phe Gly Phe Val Gly Asn 
35 40 45 

Met Leu Val He Leu He Leu He Asn Cys Lys Arg Leu Lys Ser Met 
50 55 60 

Tlir Asp He Tyr Leu Leu Asn Leu Ala He Ser Asp Leu Phe Phe Leu 
65 70 75 80 

Leu Till- Val Pro Phe Tip Ala His Tyi" Ala Ala Ala Ghi Trp Asp Phe 
85 90 95 



2 



Gly Asn Tlir Met Cys Gin Leu Leu Tlir Gly Leu Tyr Phe He Gly Phe 
100 105 110 

Phe Ser Gly He Phe Phe He He Leu Leu Tlir He Asp Axg Tyr Leu 
115 120 125 

Ala Va] Val His Ala Val Phe Ala Leu Lys Ala Arg Thr Val Thr Phe 
130 135 140 

Gly Val Val Tlu Ser Val He Tlir Tip Val Val Ala Val Phe Ala Ser 
145 150 155 160 

Leu Pro Gly He He Phe Thr Arg Ser Gin Lys Glu Gly Leu His Tyr 
165 170 175 

Tin- Cys Ser Ser His Phe Pro Tyr Ser Ghi Tyr Ghi Phe Trp Lys Asn 
180 185 190 

Phe Gin Tlii" Leu Lys He Val He Leu Gly Leu Val Leu Pro Leu Leu 
195 200 205 

Val Met Val He Cys Tyr Ser Gly He Leu Lys Thr Leu Leu Arg Cys 
210 215 220 

Arg Asn Glu Lys Lys Arg His Arg Ala Val Arg Leu He Phe Thr He 
225 230 235 240 

Met He Val Tyv Phe Leu Phe Tip Ala Pro Tyr Asn He Val Leu Leu 
245 250 . 255 

Leu Asn Tlir Phe Ghi Glu Phe Phe Gly Leu Asn Asn Cys Ser Ser Ser 
260 265 270 

Asn Arg Leu Asp Gin Ala Met Gin Val Tlir Glu Tlir Leu Gly Met Tlir 
275 280 285 

His Cys Cys He Asn Pro He He Tyr Ala Phe Val Gly Glu Lys Phe 
290 295 300 

Ai-g Asn Tyr Leu Leu Val Phe Phe Gin Lys His He Ala Lys Arg Phe 
305 310 315 320 

Cys Lys Cys Cys Ser He Phe Gin Gin Glu Ala Pro Glu Arg Ala Ser 
325 330 335 



3 



Ser Val Tyr Tlir Arg Ser Tlir Gly Glu Gin Glu He Ser Val Gly Leu 
340 345 350 

<210> 2 
<211> 1376 
<212> DNA 
<213> human 

<400> 2 

gaattccccc aacagagcca agctctccat ctagtggaca gggaagctag cagcaaacct 60 
tcccttcact acaaaacttc attgcttggc caaaaagaga gttaattcaa tgtagacatc 120 

i 

tatgtaggca attaaaaacc tattgatgta taaaacagtt tgcattcatg gagggcaacj: 1 SO 
aaatacattc taggacttta taaaagatca ctttttattt atgcacaggg tggaacaaga 240 
tggattatca agtgtcaagt ccaatctatg acatcaatta ttatacatcg gagccctgcc 300 
aaaaaatcaa tgtgaagcaa atcgcagccc gcctcctgcc tccgctctac tcactggtgt 360 
tcatctttgg ttttgtgggc aacatgctgg tcatcctcat cctgataaac tgcaaaaggc 420 
tgaagagcat gactgacatc tacctgctca acctggccat ctctgacctg tttttccttc 480 
ttactgtccc cttctgggct cactatgctg ccgcccagtg ggactttgga aatacaatgt 540 
gtcaactctt gacagggctc tattttatag gcttcttctc tggaatcttc ttcatcatcc 600 
tcctgacaat cgataggtac ctggctgtcg tccatgctgt gtttgcttta aaagccagga 660 
cggtcacctt tggggtggtg acaagtgtga tcacttgggt ggtggctgtg tttgcgtctc 720 
tcccaggaat catctttacc agatctcaaa aagaaggtct tcattacacc tgcagctctc 780 
attttccata cagtcagtat caattctgga agaatttcca gacattaaag atagtcatct 840 
^gggg^^^ggt cctgccgctg cttgtcatgg tcatctgcta ctcgggaatc ctaaaaactc 900 



4 

tgcttcggtg tcgaaatgag aagaagaggc acagggctgt gaggcttatc ttcaccatca 960 
tgattgttta ttttctcttc tgggctccct acaacattgt ccttctcctg aacaccttcc 1 020 
aggaattctt tggcctgaat aattgcagta gctctaacag gttggaccaa gctatgcagg 1080 
tgacagagac tcttgggatg acgcactgct gcatcaaccc catcatctat gcctttgtcg 1 140 
gggagaagtt cagaaactac ctcttagtct tcttccaaaa gcacattgcc aaacgcttct 1200 
gcaaatgctg ttctattttc cagcaagagg ctcccgagcg agcaagctca gtttacaccc 1260 
gatccactgg ggagcaggaa atatctgtgg gcttgtgaca cggactcaag tgggctggtg 1320 
acccagtcag agttgtgcac atggcttagt tttcatacac agcctgggct gggggt 1376 

<210> 3 
<211> 49 
<212> PRT 

<213> human imiTiiinodeficiency virus 
<400> 3 

Arg Ghi Leu Leu Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg 
15 10 15 

Ala He Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Tip Gly He 
20 25 30 

Lys Gin Leu Gin Ala Arg lie Leu Ala Val Glu Arg Tyr Leu Lys Asp 
35 40 45 

Gin 



<210> 4 
<211> 35 



5 



<212> PRT 

<213> human immunodeficiency virus 
<400> 4 

Tip Met Glu Tip Asp Arg Glu He Asn Asn Tyr Thr Ser Leu He His 
15 10 15 

Ser Leu He Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu 
20 25 30 

Leu Leu Glu 
35 

i 

<210> 5 • 
<211> 22 ' 
<212> PRT 
<213> human 

<400> 5 

Leu Leu Thr Val Glu Gin Ala Leu Ala Asp Phe Ala Glu Leu Leu Ai'g 
15 10 15 

Ala Leu Arg Ai-g Asp Leu 
20 

<210> 6 
<211> 33 
<212> PRT 



6 



<213> hioman 
<400> 6 

His Met Lys Gin Leu Glu Asp Lys Val Glu Glu Leu Leu Ser Lys Asn 
1 5 10 15 

Tyi- His Leu Glu Asn Glu Val Ala Arg Leu Lys Lys Leu Val Gly Glu 
20 25 30 

Arg 

<210> 7 
<211> 33 
<212> PRT 
<213> human 



<400> 7 

His Met Lys Gin He Glu Asp Lys He Glu Glu He Leu Ser Lys He 
1 5 10 . 15 

Tyr His He Glu Asn Glu He Ala Arg He Lys Lys Leu He Gly Glu 
20 25 30 

Val 

<210> 8 
<211> 40 
<212> PRT 
<213> hunaan 



7 



<400> 8 

Leu Tlir Asp Thr Leu Gin Ala Glu Thr Asp Glii Leu Glu Asp Glu Lys 
15 10 15 

Ser Ala Leu Gin Tlir Glu De Ala Asn Leu Leu Lys Glu Lys Glu Lys 
20 25 30 

Leu Glu Phe He Leu Ala Ala Arg 
35 . 40 

<210> 9 

<211> 40 

<212> PRT ' 
<213> human 



<400> 9 

His Met Ai-g Arg He Ala Arg Leu Glu Glu Lys Val Lys Tlir Leu Lys 
15 10 15 

Ala Gin Asn Ser Glu Leu Ala Ser Tlir Ala Asn Met Leu Aig Glu Gin 
20 25 30 

Val Ala Gin Leu Lys Gin Lys Tyr 
35 40 



<210> 10 
<211> 36 
<212> PRT 
<213> unlcnown 



8 



<220> 

<221> PEPTroE 
<222> (1)..(36) 
<223> T-20 

<400> 10 

Tyi- Thr Ser Leu He His Ser Leu He Glu Glu Ser Gin Asia Gin Gin 
15 10 15 

Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys Tip Ala Ser Leu 
20 25 30 

Trp Asn Tip Phe 
35 

<210> 11 
<211> 38 
<212> PRT 
<213> unlaiown 

<220> 

<221> PEPTIDE 
<222> (1)..(38) 
<223> DP107 
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<400> 11 

Asn Asn Leu Leu Ai-g Ala He Glu Ala Gin Gin His Leu Leu Gin Leu 
15 10 15 

Tlu Val Tip Gly lie Lys Gin Leu Gki Ala Arg He Leu Ala Val Glu 
20 25 30 

Arg Tyr Leu Lys Asp Ghi 
35 

<210> 12 
<211> 34 

<212> PRT ' 
<213> unlaiown ' 

<220> 

<221> PEPTIDE 
<222> (1)..(34) 
<223> N34 

<400> 12 

Ser Gly He Val Ghi Glu Gin Asn Asn Leu Leu Arg Ala He Glu Ala 
15 10 15 

Gin Ghi His Leu Leu Gin Leu Thi" Val Tip Gly He Lys Gin Leu Gin 
20 25 30 

Ala Arg 



10 

<210> 13 
<211> 28 
<212> PRT 
■ <213> unlcnown 

<220> 

<221> PEPTIDE 
<222> (1)..(28) 
<223> C28 

<400> 13 

Tip Met Glu Trp Asp Aig Glu He Asn Asn Tyr Thr Ser Leu He His 
15 10 15 

Ser Leu He Glu Glu Ser Gin Asn Gin Gin Glu Lys 
20 25 

<210> 14 

<211> 68 

<212> PRT 

<213> unlcnown 

<220> 

<221> PEPTIDE 
<222> (1)..(68) 



<223> N34(L6)C28 
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<400> 14 

Ser Gly He Val GLn Ghi Gin Asn Asn Leu Leu Arg Ala He Glu Ala 
15 10 15 

Gin Gill His Leu Leu GLn Leu Tlir Val Tip Gly He Lys GLn Leu Gin 
20 25 30 

Ala Arg Ser Gly Gly Arg Gly Gly Tip Met Glu Trp Asp Arg Glu He 
35 40 45 

Asn Asn Tyr Tlir Ser Leu He His Ser Leu He Glu Glu Ser Gin Asn 
50 55 60 j 

Gin GLn Glu Lys < 
65 ! 

! 

<210> 15 
<211> 39 
<212> PRT 
<213> unknown 

. <220> 
<221> PEPTIDE 
<222> (1)..(39) 
<223> T1249 

<400> 15 

Tip Gin Glu Tip Glu Gin Lys He Tlir Ala Leu Leu Glu Gin Ala Gin 
15 10 15 



12 



He Gin Gin Glu Lys Asn Glu Tyr Glu Leu Gin Lys Leu Asp Lys Tip 
20 25 30 

Ala Ser Leu Trp Glu Tip Phe 
35 

<210> 16 
<211> 502 
<212> PRT 

<213> human immunodeficiency virus 



<400> 16 

I 

Met Ai-g Val Lys Gly He Arg Lys Ser Tyr Gin Tyi" Leu Trp Lys Gly 
15 10 15 , 

Gly Thi- Leu Leu Leu Gly He Leu Met He Cys Ser Ala Val Glu Lys 
20 25 30 

Leu Trp Val Tlir Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Thr 
35 40 45 

Tlir Till- Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val 
50 55 60 

His Asn Val Tip Ala Tlu" His Ala Cys Val Pro Tlir Asp Pro Asn Pro 
65 70 75 SO 

Gin Glu Val Val Leu Glu Asn Val Tlir Glu His Phe Asn Met Tip Lys 
85 90 95 

Asn Asn Met Val Glu Ghi Met Gin Glu Asp He He Ser Leu Tip Asp 
100 105 110 

Gin Ser Leu Lys Pro Cys Val Lys Leu Tlir Pro Leu Cys Val Thr Leu 
115 120 125 

Asn Cys Lys Asp Val Asn Ala Thr Asn Thr Tlir Asn Asp Ser Glu Gly 
130 135 140 
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Tl-u- Met Glu Arg Gly Glu He Lys Asn Cys Ser Phe Asn He Tlir Tlir 
145 150 155 160 

Ser He Arg Asp Glu Val Gin Lys Glu Tyr Ala Leu Phe Tyr Lys Leu 
165 170 175 

Asp Val Val Pro He Asp Asn Asn Asn Tlir Ser Tyr Arg Leu He Ser 
180 185 190 

Cys Asp Till- Ser Val He Tlii" Gin Ala Cys Pro Lys He Ser Phe Glu 
195 200 205 

Pro He Pro He His Tyr Cys Ala Pro Ala Gly Phe Ala He Leu Lys 
210 215 220 

Cys Asn Asp Lys Tlu Phe Asn Gly Lys Gly Pro Cys Lys Asn Val Ser 
225 230 235 240 

Till- Val Glri- Cys Tlu His Gly He Arg Pro Val Val Ser Thr Gin Leu 
245 250 255 

Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val He Arg Ser Asp 
260 265 270 

Asn Phe Tlir Asn Asn Ala Lys Tin- He He Val Ghi Leu Lys Glu Ser 
275 280 285 

Val Glu He Asn Cys Tlir Ajg Pro Asn Asn Asn Thr Arg Lys Ser He 
290 295 300 

His He Gly Pro Gly Ai'g Ala Phe Tyr Tlir Tlir Gly Glu He He Gly 
305 310 315 320 - 

Asp He Arg Gin Ala His Cys Asn He Ser Arg Ala Lys Trp Asn Asp 
325 330 335 

Thr Leu Lys Gin He Val He Lys Leu Ai-g Glu Ghi Phe Glu Asn Lys 
340 345 350 

Till- He Val Phe Asn His Ser Ser Gly Gly Asp Pro Glu He Val Met 
355 360 365 

His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyi- Cys Asn Ser Thr Ghi 
370 375 380 



